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ABSTRACT 

The  objective  of  this  study  has  been  to  obtain  and  evaluate  strategies 
to  be  used  in  certain  general  search  situations.  These  strategies  minimize 
the  expected  cost  of  search  and  resulting  decisions  and  are  sequential  in  the 
sense  that  a  decision  at  any  time  is  dependent  upon  what  has  been  observed 
up  to  that  time. 

The  first  situation  studied  leads  to  the  formulation  of  a  minimum 
expected  cost  sequential  hypothesis  test.  The  target  is  either  present  in 
the  region  of  interest  with  a  priori  probability  P  ,  or  not  with  probability 
1  -P  .  Knowing  the  value  of  P  ,  at  fixed  intervals  of  time  the  searcher 
must  either  make  a  terminal  decision  (i.  e.  decide  that  the  target  is  present, 
or  not  present)  or  make  a  measurement  of  a  random  variable  that  has  a 
probability  density  function  which  depends  upon  whether  or  not  the  +arget  is 
present.  A  cost  structure  is  given  which  assigns  costs  to  wrong  terminal 
decisions,  as  well  as  a  cost  (which  depends  upon  whether  or  not  the  target 
is  present)  for  the  taking  of  a  measurement.  The  sequential  strategy  and 
resultant  minimum  cost  are  derived  by  solving  a  functional  equation  of  the 
dynamic  programming  type.  The  relation  between  this  strategy  and  the 
Wald  sequential  probability  ratio  test  is  discussed.  The  minimum  cost  of 
the  strategy  is  compared  with  the  cost  of  an  often  used  non -sequential  strat¬ 
egy  as  well  as  a  class  of  sub -optimal  sequential  strategies  that  involve 
threshold  observations. 

The  second  part  of  this  study  involves  a  situation  in  which  it  is 
assumed  that  the  target  arrives  at  some  random  time  (the  "raid-recognition" 
problem).  A  cost  structure  is  given  which  assigns  a  cost  to  deciding  the 
target  has  arrived  when  in  fact  it  hasn't,  and  also  assigns  a  cost  proportional 
to  the  time  between  arrival  of  the  target  and  the  decision  that  it  has  arrived. 
Again  observations  of  a  random  variable  related  to  the  presence  of  the  target 
are  available  as  an  alternative  to  making  such  a  decision.  The  sequential 
strategy  and  resulting  minimum  cost  are  again  obtained  by  means  of  a  functional 
equation.  An  additional  result  is  the  formulation  of  a  System  Operating  Charac¬ 
teristic  that  is  used  for  this  randomly  arriving  target  model  in  a  way  similar 
to  the  use  of  the  Receiver  Operating  Characteristic  for  the  hypothesis  test 
model. 
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CHAPTER  I 


INTRODUCTION 


Much  work  has  been  done  in  the  past  20  years  on  analytical 
approaches  to  problems  dealing  with  search  and  detection.  The  purpose 
of  the  present  work  is  to  extend  some  of  these  problems  and  to  treat 
them  from  the  point  of  view  of  Statistical  Decision  Theory. 

In  particular,  the  often  separately  stated  problems  in  "search" 
and  "detection"  will  be  considered  as  part  of  an  overall  operation,  which 
must  consequently  be  optimized  as  a  whole. 

Most  people  have  found  themselves  in  the  position  of  having 
to  look  for  something.  The  process  of  looking  for  and  (possibly)  finding 
the  object  is,  in  the  sense  of  what  will  be  treated  in  this  paper,  that 
person's  solution  to  a  search  and  detection  problem.  Whether  that 
solution  is  a  "good"  one  or  not  depends  upon  the  criterion  of  goodness 
that  the  person  has  decided  upon.  Hopefully,  if  the  person  has  a  degree 
of  consistency  which  we  would  like  to  require  of  decision  makers,  he 
could  describe  his  scheme  of  operation  to  a  confederate,  and  be  confi¬ 
dent  that  the  subsequent  results  would  be  the  same  as  if  he  himself 
carried  out  the  process,  Xn  this  age  of  automated  dec»  don  making, 
the  confederate  is  very  often  a  computer,  and  the  description  of  the 
scheme  the  appropriate  program. 

One  object  of  this  paper  is  to  consider  a  class  of  decision 
problems  that  might  be  called  "sequential  seaich  and  detection"  problems, 
and  using  a  particular  utility  structure  associated  with  these  problems 
arrive  at  a  description  of  the  "optimal"  way  of  making  these  decisions: 
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optimal  in  the  sense  of  minimizing  the  overall  cost  of  the  search  process. 
In  addition  we  are  interested  in  examining  some  non-optimal  decision 
schemes,  some  existing  and  some  proposed,  that  do  not  minimize  cost, 
but  which  are  perhaps  easily  implemented,  or  have  intuitive  appeal,  or 
both. 

r~  x 

Before  proceeding  further,  it  is  convenient  to  define  the  terms 
"search"  and  "detection".  As  nouns  {and  adjectives)  they  have  often 
appeared  in  the  literature  as  synonyms,  but  for  the  purposes  of  this  paper 
there  will  be  a  clear  distinction  made  between  them. 

Detection  involves  the  gathering  of  information  pertaining  to 
the  object  being  sought  (the  target),  the  lifting  out  what  is  useful  informa¬ 
tion  and  the  relaying  of  this  in  some  efficient  form  to  a  decision  maker. 

Search  describes  the  decisions  made  on  the  basis  of  the 
detection  information  received.  In  particular,  a  "search  strategy'-  will 
be  that  set  of  rules  that  associates  decisions  with  every  conceivable 
result  of  the  detection  process. 

When  a  collar  button  falls  to  the  floor,  the  detection  device 
of  the  eye  picks  up  the  information  that  there  is  a  strong  glint  of  light 
under  the  bureau,  and  a  Ives  strong  one  under  the  bed.  The  search  phase 
is  the  decision  to  bend  down  and  reach  under  the  bureau,  and  if  unsuc¬ 
cessful,  to  then  reach  under  the  bed.  To  continue  the  homely  example,  the 
search  also  included  the  decision  to  bend  down  in  the  first  place,  rather 
than  shrugging  the  shoulders  and  taking  a  new  button. 


The  problems  of  search  are  decision  problems.  Where  should 
one  look,  for  how  long,  with  what  equipment?  The  answers  to  these 
questions  involve  decisions,  and  in  that  any  old  answer  is  not  acceptable 
we  recognize  the  fact  that  there  are  costs  involved  in  doing  the  wrong 
things:  making  the  wrong  decisions.  In  addition  we  often  must  use  detec¬ 
tion  devices  that  give  information  imperfectly,  perhaps  describabie 
statistically.  Thus  statistical  decision  theory  is  the  most  likely  can¬ 
didate  to  be  the  tool  with  which  to  attack  search  and  detection  problems. 

1 .  1  Some  definitions 

The  definitions  offered  below  are  solely  for  use  in  the 
context  of  this  work,  and  the  author  makes  no  claims  for  their  universal 
appeal  or  application. 

Target 

The  target  is  an  object  that  is  of  primary  interest  to  the 
decision  maker  (sometimes  called  the  "searcher").  At  the  starting 
point  of  the  search  process,  the  location  of  the  target  is  uncertain. 

The  general  object  of  the  search  process  is  to  increase  the  searcher's 
knowledge  of  the  location  of  the  target.  In  general,  the  target  may 
or  may  not  change  its  location  during  the  search.  It  will  be  assumed, 
for  the  purposes  of  this  work,  that  only  one  target  is  involved  at  any 
time.  This  assumption,  although  often  unrealistic,  helps  to  point  to 
good  procedures  even  in  the  cases  where  it  does  not  hold. 

Field  (F) 

The  field  is  the  area  within  which  the  entire  search  process 
takes  place:  the  region  of  interest  of  the  search. 
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in  most  searches,  the  field  is  broken  down  into  many  smaller 
non-overlapping  areas,  A.  ,  called  cells.  In  general  a  cell  is,  in  size 
and  shape,  the  resolution  element  of  the  sensing  device  being  used--- 
the  smallest  space  that  could  contain  two  targets  without  allowing  the 
detection  device  to  determine  whether  or  not  there  are  one  or  more 
targets  present.  In  the  discussion  that  follows,  i=l,  2,  .  .  . ,  M  ,  where 
M  is  the  number  of  cells  in  the  field.  The  cell  AQ  represents  the 
location  "nowhere  in  the  field". 

State  of  Nature  (S. ) 

The  state  of  nature  is  a  description  of  the  actual  location 
of  the  target.  The  abbreviation  S.  stands  for  the  state  of  nature: 

{the  target  is  present  in  cell  A^}  .  The  abbreviation  stands  for 
the.  state  of  nature:  {the  target  is  not  present  anywhere  in  F}  . 

A  Priori  Target  Location  Probability  Vector  (P) 

This  vector  summarizes  the  degree  of  uncertainty  concerning 
the  target  location  at  the  start  of  the  search,  where  F.  is  the  a  priori 
probability  that  8^  is  the  state  of  nature.  The  author  does  not  wish  to 
embroil  himself  in  arguments  concerning  the  existence  of  this  P  vector 
To  the  unconvinced  reader  two  suggestions  are  offered:  he  can  consider 
the  placement  of  the  target  to  be  determined  by  some  appropriate  randor 
experiment  (e.  g.  dice  throwing),  the  outcomes  of  which  can  be  associate 
with  the  P.  's  ;  or  he  can  replace  the  word  "probability "  by  "plausabilit 
throughout  the  remainder  of  this  work,  with  the  assurance  (for  example 
see  Jeffreys  (23))  that  the  mathematical  development  will  be  identical). 


Detection  Process 

The  detection  process  is  that  process  by  which  information 
(concerning  the  presence  or  absence  of  the  target  in  one,  some  or 
all  of  the  cells)  is  obtained  by  means  of  observation  or  measurements 
of  various  physical  phenomena  related  to  these  cells;  such  as  reflected 
energy,  radiated  energy,  etc.  The  apparatus  used  for  these  measurements 
is  called  the  "detection  device",  or  the  "receiver". 

Noise 

Noise  is  the  collection  of  those  factors  (random  or  other¬ 
wise)  that  make  the  detection  device  produce  target-like  signals  when 
the  target  is  in  fact  absent. 

Decision  (D,  or  W) 

A  decision  (D/f  is  a  commitment  by  the  searcher  to  take 

action  associated  with  the  belief  that  the  target  is  in  cell  .  Or 

it  may  be  the  commitment  to  wait  (W)  for  more  information  from 

the  detection  device.  The  decisions  D.  are  called  "terminal  decisions". 

1 

Search  Strategy 

A  search  strategy  is  a  set  of  rules  which  assigns  decisions 
to  all  possible  outcomes  of  the  detection  process. 

Right  Decision 

A  right  decision  occurs  when  D.  is  made  and  S.  is  the 
i  J  J 

state  of  nature,  written  {D,|S,}  , 

J  J 

Wrong  Decision 

A  wrong  decision  occurs  when  D.  is  made  anu  (i*j)  is 
the  state  of  nature,  written  {D.|S,},  (i*j)  . 
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CHAPTER  II 


BACKGROUND 


2.  1  General 

Published  work  on  the  subject  of  search  and  detection  seems 
to  fall  into  three  separate  groups,  with,  as  a  rule,  little  or  no  discussion 
of  the  relation  among  them.  Koopman  (26)  has  categorized  these  groups 
as  dealing  with  1)  Kinematics,  2}  Distribution  of  Search  Effort,  3)  Target 
Detection. 

Problems  concerning  1 ‘kinematics  M  have  to  do  primarily  with 
the  relative  motions  between  searcher  and  target.  Detection  is  assumed 
perfect,  once  an  interception  occurs,  and  the  problems  are  directed 
cowards  establishing  optimal  pursuit  and  evasion  strategies,  prediction 
of  target  course,  etc.  The  results,  which  are  rather  complete  and 
cover  a  wide  range  of  models,  are  to  be  found  primarily  in  Koopman 
(26,  27),  as  well  as  Gluss  (14,  15),  Danskin(ll),  Isbell  (22),  Banta  (1) 
and  others.  These  kinematic  considerations  are  outside  the  scope  of 
the  present  work,  inasmuch  as  the  problems  of  imperfect  detection  and 
possible  false  alarms  and  decisions  are  not  involved. 

The  results  of  previous  work  in  both  Distribution  of  Search 
Effort  and  Target  Detection  set  the  groundwork  for  the  present  paper, 
and  the  following  two  sections  develop  what  has  been  done  in  these 
areas. 
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2.2 


Distribution  of  Search  Effort 


Most  of  the  work  published  under  the  general  topic  of  "sear 
theory*' has  been  involved  with  Koopmans'  second  category:  distribut 
of  search  effort.  These  problems  in  general  assume  that  the  availabl 
detection  devices  are  ones  that  will  not  yield  target-like  information  5 
the  target  is  not  present  and  will  detect  only  some  fraction  of  the  tin- 
the  target  is  present.  This  fraction  is  called  the  detection  probabilit 

Thus  the  search  strategy  becomes  straightforward.  Whe  j 
target-like  information  is  received  from  ,  the  target  must  be  the. 

so  decide  D.  . 

1 

The  problem  thus  becomes  not  what  to  do  when  target-like 
information  is  received,  but  what  to  do  until  it  is  received. 

The  solutions  are  in  the  form  of  search  strategies,  invol\i' 
the  distribution  of  the  available  "magnitude"  of  search  effort  that  she 
be  placed  upon  the  cells.  It  is  assumed  that  relations  between  the 
amount  of  search  effort  used  in  observing  a  cell,  and  quality  of  detec 
tion  in  that  cell  (as  reflected  in  detection  probability)  are  known  quant 
tively.  The  distribution  of  effort  is  selected  to  optimize  some  entire 
measure  of  the  search,  such  as  maximizing  the  probability  of 
detecting  the  target  for  a  given  search  effort.  Another  criterion  (whit 
proves  to  yisld  identical  strategies  for  m'ost  models)  would  be  to  mini 
the  average  search  effort  heeded  to  eventually  detect  the  target. 

/  The  most  general  solutions  to  this  problem  have  been  obtait 

when  the  field  represents  a  continuum  (the  cells  becoming  differential 
areas),  by  Koopman  (32,  33)  and  de  Geunin  (22).  Other  cases,  involv 
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so  culled  "discrete"  or  finite  sized  cell  cases,  have  been  treated  by  Others 

(5,  6,  10,  13,  30,  35,  36,  37)  with  solution?  all  approaching  Koopman's  in 

* 

the  limit  as  cell  sizes  decrease.  However,  all  these  approaches,  as  pointed 
out  above,  neglect  the  possibility  of,  and  hence  the  cost  of  false  alarms  due 
to  noise. 


Target  Detection  -  STSD  and  Hypothesis  Testihs 


What  is  known  as  the  Statistical  Theory  of  Signal  Detection  (STSD) 
has  been  developed  in  the  past  decada,  primarily  by  Middleton  (33,  34). 
Helstrom  (19)  and  others  have  extended  this  theory. 


Hypothesis  testing  is  used  as  a  framework  upon  which  to  develop 
receivers  that  take  into  consideration  the  presence  of  noise  in  each  cell. 
The  theory  is  quite  elegant,  and  much  of  it  is  concerned  with  problems 
of  devising  measures  (or  statistics)  of  received  waveforms,  allowing  for 
the  possibility  of  noise  and  target  signals  of  all  statistical  varieties. 
However,  the  theory  tends  to  not  differentiate  between  search  and  detec¬ 
tion.  It  also  has  tended  to  ignore  the  potential  of  the  use  of  sequential 
rules  involving  costs  of  wrong  decisions.  In  Order  to  make  some  of  these 
points  clear,  as  well  as  to  lay  the  ground  for  a  more  general  extension 
of  the  theory,  let  us  consider  the  basic  ideas  involved  in  the  STSD. 


Consider  the  field  to  consist  of  only  one  cell,  .  Thus, 
there  are  two  possible  states  of  nature: 


Sq  s  target  is  not  present. 

Sj  s  target  is  present. 

It  is  assumed  that  the  target  is  stationary.  That  is,  only  one  of  these 
states  describes  the  target  position  for  the  entire  duration  of  the  search. 
The  target  cannot  move,  or  evade  the  searcher.  (This,  of  course,  limits 
the  generality  of  the  model,  and  the  eventual  lifting  of  this  condition  is 
one  of  the  aims  of  this  study. ) 
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Now,  upon  observing  the  cell  with  the  detection  device,  the 
searcher  can  make  one  of  two  decisions  ,  which  by  the  assumptions 
of  STSD  are  equivalent  to  taking  one  of  two  actions : 

Dq  ~  take  action  appropriate  to  target  being  absent, 
r  take  action  appropriate  to  target  being  present. 

Let  uc  represent  the  observation  by  x  {possibly  a  vector). 

If  the  observer  knows  from  experience  that  the  sensing 
device  operates  in  such  a  way  that  if  a  target  is  present  (S^)  the 
reading  would  be  ,  but  if  the  target  is  not  present  (Sq)  ,  the 
reading  would  be  xQ  ,  then  the  decision  is  sraight  forward: 

when  x  =  Xq  : 


(Note  that  a  reading  of  x  #  x^  or  x^  is  impossible.) 

A  more  realistic  measurement,  however,  is  subject  to 
random  fluctuations  due  to  many  incomprehensible  or  unpredictable 
effects  including  those  in  the  processing  equipment,  as  well  as  target 
characteristics,  errors  of  observation,  etc.  The  outcome  of  the 
obeervation,  x  ,  under  the  two  possible  hypotheses,  is  thus  a  random 
variable,  and  is  at  best  describable  by  known  probability  density 
functions.  If  these  probability  density  functions  are  completely 
described  and  involve  no  unknown  parameters  then  the  hypotheses 
are  called  "simple”.  This  more  realistic  example,  then,  is  the 
testing  of  two  simple  hypotheses,  where  the  outcome  x  will  have 
a  known  probability  density  function  p  (x)  under  hypothesis  Sn  , 
and  Pj  (x)  under  hypothesis  .“ 


♦ 


J 

“1- 


The  observer  must  be  able  to  decide  upon  either  Dq  or  Dj 
for  any  possible  observed  value  of  x  .  The  resulting  decision  rule 
divides  the  range  of  x  into  two  exhaustive  regions  Xq  and  ,  with 
the  result: 


if  x  eX, 


if  x  eX 


1 


(2.1) 


Because  of  the  probabilistic  nature  of  the  observation,  it 
is  now  possible  for  the  observer  to  make  a  mistake:  to  say  one 
hypothesis  is  correct  wtan  in  fact  the  other  is  actually  true.  A 
way  to  describe  the  possibility  of  a  wrong  decision  is  to  associate 
with  any  decision  rule  such  as  (2,  1)  above  two  error  probabilities. 
These  are  defined  as  follows: 


If  the  true  state  of  nature  is  described  by  ,  then  the 
probability  of  making  decision  Dj  (a  mistake)  is  defined  as  a  , 
or  "error  of  the  first  kind".  Written  in  the  usual  probabilistic 
notation,  a  =  prob.  {D^fs^  }  .  Correspondingly,  the  probability 
of  making  DQ  when  the  true  state  of  nature  is  in  fact  described 
by  Sj  is  defined  by  (3  or  "error  of  the  second  kind".  That  is, 

$  -  prob.  {Dq|Sj  }  .  In  the  usual  statistical  sense,  5^  is  said 
to  be  the  "null  hypothesis".  Our  choice,  in  this  detection  model, 
of  Sq  being  "target  absent "  is  completely  a  matter  of  convenience. 

In  the  detection  model  that  we  are  describing,  it  is  now 
a  conceptual  advantage  to  speak  of  a  detection  probability  and 
a  false  alarm  probability  p^  .  These  are  a  matter  of  definition, 
and  it  is  convenient  here  to  relate  them  to  a  and  (3  .  Thus, 

Pd  =  prob,  {DjlSj  hi  -  prob.  {  DQ  |  Sx  }  *  1  -  p 

Pf  a  prob,  {Dj  |SQ  }  =  a 
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(note  that  these  probabilities  are  strictly  conditional  upon  the  state 
of  nature,  and  that  conclusions  reached  by  consideration  of  these 
numbers  alone  must  in  themselves  be  conditional.  This  will  be 
discussed  in  the  next  section. ) 

In  terms  of  the  known  p.d.fs  (x)  and  p^  (x)  we  see 
that  a  and  £  are  given  by 


for  x  a;  x*  :  Dy 
x  >  x*  :  Dx  . 

Here  s  [  "*  *  x*  ]  and  Xj  =  [  x*  ,  oo  ]  .  The  error  probabilities 
would  be  then  given  by 


y._  — 


o=l-  erf  ( — ) 
<r 


For  constant  p  and  <r  ,  a  and  (3  are  related  to  the  parameter  x* 
so  that  as  x*  increases,  a  decreases  and  p  increases.  This  is 
shown  in  the  curve  s  in  Figur  e  2.  2,  which  are  for  cr  =  1  .  Curves  such 
as  these,  that  relate  a  to  p  (or  p^  to  p£)  for  a  particular  device 
are  called  ROC  or  Receiver  Operating  Characteristic  curves.  Figure 
2.2  shows  the  ROC  of  p^,  v.  a.  p£  with  p  as  a  parameter.  It  is 
sometimes  convenient  to  draw  the  ROC  as  a  plot  of  p^  v.  s.  p  ,  with 
p£  as  a  parameter,  where  p  is  in  general  some  .-measure  of  the  differ- 
between  the  p.  d.  f.  s  p^  (x)  and  p^  (x). 

At  this  point  we  have  not  yet  determined  what  the  observer 
is  actually  trying  to  do.  He  might  have  some  vague  ideas  about  achiev* 
a  "reasonable"  detection  probability  while  keeping  the  false  alarm  prob. 
bility  "low".  A  quantification  of  this  concept  is  the  subject  matter  of 
the  next  section.  '  However,  at  this  point,  we  can  agree  upon  some  as¬ 
pect  of  goodness,  so  that  we  can  immediately  compare  certain  sensing 
devices. 

Suppose  two  devices  are  available  to  use  tn  searching  a  cell. 
Device  A  has  p .  =  1  -  P  =  .8  ,  p  =  a  =  .01  ,  while  device  B  has 

O  X 

p  ■  1 .0  «  .  §  ,  p£  =  o  =  .  02  .  Clearly,  device  A  is  beter  than  device 
B  by  whatever  reasonable  criterion  is  used.  This  concept  is  formalize* 
in  the  following  way. 

Let  us  consider  two  search  devices  ,  d^  with  error 
probabilities  (o^  ,  pj)  and  (o^  ,  respectively.  Then  d^  is 
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Figure  2.  2  Receiver  Operating  Characteristic  (ROC) 
for  device  illustrated  in  Figure  2. 1 
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•aid  to  be  preferable  to,  or  better  than,  if  o2  and  Pj  2£  P2  . 

Thus,  if  all  possible  devices  are  represented  on  a  p  v.  s.  p,  ROC 

a  I 

curve,  any  given  device  is  better  than  those  that  are  "below  and  to 
the  right",  while  it  is  worse  than  those  that  are  "above  and  to  the 
left".  However,  if  Oj  Sa2  and  Pj  —  fchen  di  and  d2  are 
said  to  be  non-comparable,  ar,td  more  must  be  known  before  a 
comparison  may  be  made  between  the  two. 

Looking  at  the  ROC  in  Figure  2.2,  we  see  that  for  a  given 

value  of  p  ,  as  x*  is  varied  from  minus  infinity  to  infinity,  a  whole 

continuum  of  possible  devices  is  described,  ail  of  these  devices  being 

non -comparable  in  the  sense  given  above.  For  a  given  p  we  cannot 

yet  decide  which  among  them  is  best.  However,  if  we  compare  the 

curves  for  pj  and  p^  (p^  >  H-j)  we  see  that  for  any  point  (representing 

a  device)  on  the  p^  curve,  we  can  find  "above  and  to  the  left"  of  it 

some  point  on  the  p^  curve.  Thus,  an  increase  in  p  is  always 

desirable  in  the  system  as  illustrated  in  Figure  2.  2,  as  long  as  x* 

is  free  to  be  set. 

> 

In  the  statistical  literature,  a  is  called  the  "size"  of  a 
particular  test  or  experiment,  and  for  a  given  value  of  a  the  largest 
attainable  value  of  1  -0  ■  p^  is  called  the  "power  ", 

2.4  Decision  Criteria 

In  the  last  section,  we  have  seen  that  a  detection  device 
combined  with  a  decision  rule  may  be  operationally  described  by 
the  associated  error  probabilities  a  and  p  .  Let  us  call  a  combination 
of  a  detection  device  and  a  decision  rule  a  "system". 
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In  order  to  evaluate  a  system,  or  equivalently,  to  be  able 
to  compare  two  or  more  systems,  there  must  exist  some  measure 
of  the  effectiveness  of  the  entire  system  (including  the  specific  decision 
rule  used).  The  observer  must  be  searching  for  the  target  with  some 
idea  as  to  not  only  what  actions  will  be  taken,  but  the  costs  associated 
with  them,  if  he  detects  (or  thinks  he  detects),  or  does  not  detect  the 
target. 

The  object  is  now  to  determine  which  decision  rule  is  ,1bestM 
in  some  way.  For  example,  in  the  system  illustrated  in  Figure  2. 1 
what  value  of  threshold  x*  should  be  used? 

Since  it  is  reasonable  to  assume  that  the  search  system 
will  be  used  over  and  over  again  for  some  length  of  time,  systems 
may  be  compared  in  terms  of  their  overall,  long-term  average 
performance. 

For  each  state  of  nature,  the  cost  of  making  decision 
or  will  be  different.  In  general,  it  is  possible  to  list  the  costs 
of  all  possible  state — decision  situations  in  a  matrix  C; 


where  C, .  is  the  cost  of  making  decision  O,  when  in  fact  S.  is 
ij  *  i  J 

true.  (For  this  to  be  a  reasonable  matrix  Cqq  <  <  Cjq  : 

the  cost  of  making  a  correct  decision  is  smaller  than  the  cost  of 
making  a  wrong  decision. ) 


One  more  important  concept  must  now  be  introduced  before 
we  can  discuss  optimization  of  decision,  rules.  We  have  noted  that 
the  error  prooabilities  a  and  p  are  conditional  upon  either  the 
state  of  nature  or  being  true.  In  order  to  obtain  an  over¬ 
all  probability  of  making  an  error,  and  more  important,  to  obtain 
the  average  cost  of  a  particular  decision  rule,  it  is  necessary  to 
state  the  a  priori  probabilities  of  Sq  and  Sj  being  true.  In  parti¬ 
cular  let  P  =  prob,  (S^)  ,  i-P  =  prob.  (S^  )  be  the  a  priori  probability 
P  then  is  the  (perhaps  subjective)  probability  that  the  target  is  in  the 
cell. 


In  the  discussion  that  follows  we  shall  consider,  as  in  the 
example  in  the  previous  section,  the  result  of  the  search  in  a  cell 
to  be  some  observable  x  .  The  decision  rule  then  consists  of  des¬ 
cribing  two  exhaustive  and  mutually  exclusive  regions  of  the  x~space 
Xq  and  ,  such  that,  as  before 


if  x<Xq 


x<Xj  :  Dj  . 


Again,  p.  d.f.  s  of  x  under  the  hypothesis  and  are 
assumed  to  be  known,  and  are  given  as  p^  (x)  and  p^  (x)  respectively 


2. 4. 1  Bayes  Criterion 

A  rule  that  selects  a  strategy  so  that  the  expected  cost  per 
decision  using  that  strategy  is  a  minimum,  compared  to  any  other 
rule,  is  called  a  "Bayes  "  decision  rule*.  For  a  particular  decision 


♦The  naming  of  this  criterion  for  the  Reverend  Bayes  is  somewhat 
puzzling,  but  the  term  is  too  familiar  in  the  literature  to  change  it 
here. 
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rule  represented  by  regions  and  Xj  ,  the  expected  cost  per  decision 

cost  is  seen  to  be  given  by 

C  =  <SoJ£C00{D0IS0>  4  C10{D1|S0)J  +{S1>[CU{D1|S1>  +C01{D0|Sl}] 

(2.3) 

where  for  convenience  we  let  {E}  =  prob.  {E}  ,  (E  is  any  event). 

Using  expressions  from  equation  (2.  2)  we  have 


C  =  (1  -P)  {C00  JpQ  (x)  dx  +  C1Q  JpQ  (x)  dx)  +  P  {C][  1  j1p1  (x)  dx  + 


+  C, 


(1  Jp1  (x)  dx) 


(2.4) 


This  expression  is  indeed  the  expected  cost.  In  the  first 
bracket  in  equation  (2.4)  the  first  term  represents  the  cost  of  making 
the  right  decision,  times  the  probability  that  this  decision  will  be  made, 
both  conditioned  on  being  the  state  of  nature.  To  this  is  added  the 
cost  of  making  the  wrong  decision  times  the  probability  of  making  it, 
again  conditioned  upon  .  This  sum  is  then  multiplied  by  the  proba¬ 
bility  that  Sq  will  be  the  state  of  nature.  The  second  tern*  is  a 
similar  expectation  given  is  true,  times  the  probability  that 
will  be  true. 


The  optimum  Bayes  decision  rule  is  one  that  selects  regions 
Xq  and  Xj  such  that  the  expression  C  in  equation  (2.4)  is  a  minimum. 
It  can  be  shown  (see,  for  example  reference  (19)  )  that  the  optimum 
Bayes  decision  rule  is  the  one  that  assigns  X^  and  X^  such  that 
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PL  (x) 

C10"C00 

P0(x) 

*  p 

o 

o 

s— • 

I 

o 

I—4 

M 

Pj  (x) 

C10"C00 

P0(x> 

coi-cn 

then  x  eX^  : 


then  *  e  Xq  :  . 


(2.5) 


P2  (x) 

The  quantity  L  (x)  =  — r-r  is  known  as  the  likelihood  ratio. 

PQ  (x) 

In  most  cases  of  interest  L  (x)  wil)  be  monotonic  in  x  ,  so 
that  the  decision  rule  (2.  5)  may  be  re-written 

if  x  2:  x*  :  Dj 

x  i.,  x*  :  D0 

where  x*  *s  called  the  "threshold1',  or  "bias",  and  is  the  solution  tc 
the  equation 


L(x*)  = 


Note  that  since 


1-P  C10~C00 

p  corcn 


(2.6) 


oc 

pd  =  Jplw<lx 

X * 

0p 

Pf  ■  Jp0  (*>  d* 

X* 

equation  (2.  6)  may  be  written 


(2.7) 


d<t>d) 


l-P  C10~C00 

p  co.-cn 


X=X* 


(2.8) 
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Using  equation  (2.  8)  and  the  ROC  it  is  quite  easy  to  obtain  the 
threshold,  and  operating  point  (equation  2.  7)  for  any  given  device.  Since 
the  ROC  is  just  a  plot  of  v.  s,  p^  ,  simply  find  the  point  at  which  the 
slope  is  equal  to 


This  is  illustrated  in  Figure  2.  3. 

The  average  minimum  cost,  using  the  Bayes  solution  then 

becomes 

5min  =  l1'*)  <C00  +  p£(C10-C00>  >  +  P{C01  +  pd(CirC01>  >  (2'9> 

2.4.2  Numerical  Example  -  The  Shepherd  and  the  Wolf 

To  illustrate  the  above  development,  and  to  provide  an  example 
for  comparison  of  STSD  with  what  is  to  follow,  we  shall  now  discuss  a 
hypothetical  search  situation.  An  intentional  attempt  has  been  made  to 
avoid  a  military  example,  but  the  reader  may  car  fo  interpret  the. 
target,  detection  device  and  searcher  in  the  exam.  .  y  in  whatever  way 
he  wishes. 

The  proverbial  Boy  who  Cried  Wolf  comes  rushing  to  a  shepherd 
with  the  news  that  there  is  a  wolf  hiding  near  the  pasture,  waiting  for 
the  shepherd  to  go  to  town,  The  shepherd  knows  from  past  experience 
that  the  boy  has  probability  P  of  telling  the  t*ath, 

In  order  to  avoid  the  long  walk  tc  the  pasture,  the  shepherd 
has  installed  a  microphone  there,  which  record*  the  pasture  noises 


Figure  2.  3  Uee  of  the  HOC  to  obtain  Bayes*  minimax 
and  Neyman-Pearaon  operating  points 
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(integrated  over  ten  minutes)  and  registers  the  intensity  on  a  patented 
J,Baa -meter  ".  It  is  known  that  if  a  wolf  is  present  in  the  pasture,  the 
measurement  x  of  the  meter  will  have  the  p.  d.  f. 


Px  (x)  =  ±N  (x;  1,  1) 

while  if  there  is  no  wolf  present,  the  measurement  x  will  have  the 
p.  d.  f. 


P0  (x)  =  fN(x;  i)  * 

Once  the  shepherd  observes  a  measurement,  he  must  decide 
whether  to  go  out  to  the  pasture  and  hunt  the  wolf,  or  go  to  town  as 
planned.  The  costs  of  making  the  wrong  decision  are  as  follows: 

If  the  shepherd  goes  to  town  he  makes  a  $  50  sale.  However, 
if  the  wolf  is  indeed  p  aseat,  a  sheep  worth  $  100  is  eaten,  making  a 
net  cost  of  $  50. 

If  the  shepherd  goes  to  the  pasture  and  the  wolf  is  present, 
he  kills  it  and  collect  a  $  100  bounty.  But  whether  or  not  the  wolf  is 
there  he  loses  the  chance  to  n.'‘kc  the  $  50  sale  in  town.  Thus  the 
cost  matrix  is 


[C..]  * 
ij 


1 

Ul 

o 

50 

50 

-50 

Since  this  cost  matrix  may  be  additively  normalised,  we  could  also 
write  the  equivalent  matrix  (in  units  of  $  100): 


[Cy]  = 


0  1 
1  0 


-  .5 


where  the  matrix  term  ia  the  loss  due  explicitly  to  decision  errors. 
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By  utilizing  this  information  and  substituting  into  equation 
{2.  6)  the  shepherd  finds  that  his  optimal  threshold  is  the  solution  to 


L  (x*)  =  exp  (x* 


1-P 

P 


x*  =  j  +  In 


1-P 

P~ 


1 

If  the  boy  has  a  P=~  of  being  right,  then  if  the  Baa -meter  reads 
1  2 

x  >  —  ,  the  shepherd  goes  to  the  pasture,  and  he  goes  to  town  if 

4 

x  <  ~  .  The  resulting  detection  and  false -alarm  probabilities  are 
4 


Pd  =  1  -  erf  (-  ^-)  =  .  69 
pf  =  1  -  erf  (“)  =  .  31 


and  the  average  cost  to  the  shepherd  is  -  $  19  {a  profit  of  $  19). 


2.4.  3  Other  Criteria:  Minimax  and  Neyman-Pearson 

It  sometimes  happens  that  the  decision  maker  thinks  he 
cannot  estimate  the  prior  probabilities  1  -P  and  P  of  the  two 
states  of  nature  and  ,  or  that  he  cannot  supply  a  cost 
matrix  (C^)  ,  or  both.  We  shall  quickly  mention  here  methods 
that  have  been  used  to  overcome  these  difficulties.  However,  it  is 
to  be  kept  in  mind  that  all  of  these  methods  lead  to  accepting  some 
operating  point  for  the  system,  and  from  the  discussion  in  the  previous 
section  we  see  that  thi3  point,  corresponding  to  a  Bayes  solution, 
implies  some  specific  relations  about  costs  and  prior  probabilities. 

When  the  prior  probability  P  is  assumed  to  be  unknown 
the  'minimax"  criterion  calls  for  operating  such  that  no  matter  what 
P  is,  the  maximum  possible  loss  is  minimized.  This  argument  is 


usually  justified  by  considering  "nature",  or  the  target,  to  bs  in  a  two- 
zero-sum  game  against  the  searcher,  so  that  it  tries  to  pick  the  worst 
possible  P  .  A  very  good  treatment  of  this  approach  is  presented  in 
Blackwell  and  Gir schick  (7).  Because  of  the  game  characteristics,  the 
solution  may  be  found  by  finding  the  maximum  of  the  minimum  cost  to 
the  searcher  obtained  by  using  a  Bayes  !  Rule  as  if  the  P  was  known, 
and  then  operating  as  if  the  P  producing  this  maximum  cost  was  in 
fact  the  real  prior  probability.  The  proper  operating  point  may  be 
obtained  by  noting  that  maximizing  equation  (2.9)  with  respect  to  P 
yields  the  expression 

C00  +  C10Pf  =  C01  *1_pd*  +  C11  Pd 

which  is  a  straight  line  in  the  p^-p^  plane.  The  intersection  of  this 
line  with  the  ROC  gives  the  minimax  operating  point  (see  Figure  2.3). 

TYhen  the  searcher  is  unwilling  to  supply  both  the  prior 
probabilities  and  a  cost  matrix,  then  the  Neyman- Pear  son  criterion 
is  often  used.  The  searcher  selects  rjome  arbitrary  value  of  false 
alarm  probability,  p^*  ,  which  must  never  be  exceeded.  The  solution 
is  to  maximise  p^  such  that  p^  sS  p^*  .  For  most  simple  cases,  when 
the  ROC  is  monotonic  as  in  Figure  2. 1,  this  is  achieved  by  operating 
such  that  Pj  =  p^*  ,  and  the  threshold  x*  the  solution  to 

00 

Jp0  (x)  dx  =  pf* 

x* 

Other  criteria  are  mentioned  in  the  literature  and  include 
minimizing  p^  for  a  fixed  p^  (a  sort  of  inverse  Neyman-Pearson 
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procedure);  assuming  that  c01=Cj0  ,  ’  and  t**1® 

‘ideal  Observer'^,  and  so  on. 

In  most  of  the  work  of  this  paper  it  shall  be  assumed  that 
prior  probabilities  are  available,  and  that  specific  cost  models  are 
applicable,  so  that  the  criteria  mentioned  above  -will  not  be  useful 
except  for  possible  comparative  purposes  It  is  important  to  note, 
however,  that  it  can.  be  shown  that  all  of  the  above  criteria  produce 
a  threshold  decision  rule.  That  is,  the  rules  all  require  the  calcu¬ 
lation  of  a  likelihood  ratio,  and  comparison  Of  it  to  some  fixed 
value. 


2.5  Sequential  Hypothesis  Testing  -  The  sprt 


The  decision  structures  in  the  previous  sections  were 

all  derived  on  the  basis  that  the  observable  could  be  a  vector 

x  =  (x,  ,x_, .  . .  ,x  )  ,  so  that  the  threshold  x*  could  be  a  n-1 
—  i  c  n  — 

dimensional  figure  in  n-space  represented  by  the  solution  to 


pl  (**)  *xf  »•••»**) 

pQ  <**>  *  p0<xf  •*!’••"*;> =  A 

.  .  ,  ,  .  l-pf  C10"C00| 

where  A  is  some  fixed  value,  such  as  -=-  I-— - — —  j  as  xn 

P  \C01‘C11  / 

section  (2. 4).  The  components  of  x  may  be  n  values  of  some 


measurement  taken  at  successive  times,  Ths  analysis  in  the 


previous  sections  holds  providing  that  n  is  fixed  in  advance. 


When  the  cost  of  taking  each  measurement  xs  considered 
however,  it  may  be  desirable  not  to  fix  in  advance  the  number  of 
measurements  to  b«  taken,  but  to  allow  for  the  number  to  be  determined 
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on.  the  basis  of  what  has  been  observed  up  to  that  time.  This  is  called 
a  sequential  test,  and  the  related  theory  has  been  developed  primarily 
by  Wald  (47,  48).  In  the  consideration  of  search  problems,  the  theory 
has  been  specifically  applied  to  the  sequential  testing  of  hypothesis, 
these  usually  being  the  two  aleternatives  and  .  Again  it  is 
assumed  that  the  target  is  stationary  throughout  the  search. 

The  Wald  sequential  probability  ratio  test  (sprt)  involves 
not  only  the  two  possible  decisions  (D^  and  D^)  considered  before, 
but  also  includes  a  third  possible  decision  W  :  wait  for  at  least  one 
more  observation. 

On  the  basis  of  observing  Xj  for  the  first  measurement, 
the  likelihood  (x^)  =  p^  (Xj)/Pq(Xj)  is  computed  and  compared  to 
two  quantities  A  and  B  (A  >  B)  .  The  decision  rule  is 

if  Lj  (x^)  >  A  : 

L1(x1)<B  ::  Dq  (2.10) 

BSLj  (x:)  ^  A  :  W  . 

The  Dq  and  Dj  decisions  are  the  terminal  decisions,  and  they 
stop  the  process  with  a  definite  commitment  of  some  sort  of  action. 

If  the  W  decision  is  made,  then  a  new  measurement  is  taken,  and 
a  new  likelihood  ratio  is  computed.  If  the  process  has  gone  on  for 
k  observations,  then  the  likelihood  to  be  used  in  the  next  comparison 
is 

pl  (xj ,  x2, . . . ,  xk) 

Lk(xrx2’,,,,xk>  =  p0(x1,x2t...>xk) 
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with  the  decision  rule  the  same  as  in  (2.  J^),  but  with  replaced 

by  V 

The  crux  of  the  profctem  is  calculating  the  values  of  A 
and  B  so  that  a  given  criterion  is  satisfied. 

■With  the  exception  of  Blackwell  and  Girshick  (7),  the 
accepted  procedure  has  been  to  use  a  criterion  based  upon  conditional 
error  probabilities  (o  and  |3)  only.  That  is,  A  and  B  are  selected 
so  that  when  Sq  is  the  state  of  nature  the  probability  of  reaching 
terminal  decision  vs  ,  while  if  S,  ,  the  probability  of  ter¬ 
minating  with  decision  is  1-p^  •  Obtaining  these  required  values 
of  prob.  {Bj  |Sq  }  ,  and  prob.  {D0|S,}  under  the  sprt  is  in  genera1 
a  very  difficult  problem.  Wald  has  shown,  however,  that  the  relatione 
between  A  and  B  and  p^  and  p^  are  governed  by  the  inequalities 


(2.  1. 


In  order  to  understand  the  test  better,  let  us  examine  the 
cate  where  the  observations  are  independent.  Then  we  can  write 


. ^  -  A  p‘"v  ■  l-M- 

we  take  logarithms  of  the  likelihood  function  as  well  as  the  boundariof 


zk  =  toLk(ltrx2 . V-I'j 

j=i 
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a  =  in  A 


b  =  in  B 


where 

z-  =  ^[px  (x  )/p  (x  )]  (2.12) 

J  J  J 

Then  the  decision  rule  (2.  10)  becomes 

if  2  >a  :  D. 
k  1 

zk<b  :  Do 

b  <  Z,  <  a  :  W 
k 

where  is  the  sum  of  k  identically  distributed  random  variables 

z.  .  The  density  function  of  z.  is  %q(')  or  g^(’)  as  the  state  of 
J  J 

nature  is  SQ  or  ,  where  g^(z)dz  =  p.  (x)  dx  and  x  and  z  are 
related  as  in  (2.  12). 

Thus  we  can  view  the  process  as  a  random  walk  of  a  g.  (• )  - 
distributed  random  variable,  with  absorbing  barriers  at  a  and  b  .  A 
further  discussion  of  this  problem  is  taken  up  in  appendix  A  . 

It  can  be  shown  that  for  a  large  class  of  p.  d.  f.  s.  on  z  the 
process  will  eventually  terminate.  That  is 

prob.  {b  <  <  a  j  for  all  k£n)  -*0  as  n-*  *>  . 

The  advantages  of  using  the  sprt  is  in  the  proven  result 
(see  references  (47,  48)  )  that  the  average  sample  number  (ASN) 
needed  to  obtain  a  given  set  of  a  and  (3  error  probabilities  (i.e. 
a  given  p^  and  p^)  is  always  less  than  the  fixed  sample  length  (say  N) 
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required  to  achieve  the  same  error  probabilities,  no  matter  what  the 
state  of  nature.  {Since  the  number  of  samples  in  the  sprt  is  a  random 
variable,  however,  it  is  to  be  noted  that  it  will  occasionally  be  greater 
than  N. ) 

It  can  be  shown  that  if  the  z.  are  in  some  sense  small  in 

3 

comparison  with  the  distance  (a-b)  ,  the  inequality  signs  in  (2.11) 
may  be  replaced  by  equalities.  With  this  approximation  it  is  also 
possible  to  derive  expressions  for  the  ASN  under  both  states  of 
nature  (see  Wald  (47)  for  the  derivation)  : 


ASN0  =  [a  -  (l-pf)(a-b)]/EQ(z) 
ASNj  =  [a  -  (1  ”P<j)(a-b)]  / Ej  (*) 


where 

00 

E^tx)  *  ,^ln[  pi  (xJ/PqUc)]  p^txjdx  ;  k  =  0,  1  (2.1 

-oo 

is  the  expected  value  of  the  random  variable  z  under  .  When 
Kq(s)  or  E^  (s)  are  sero,  these  equations  may  be  replaced  by  ones 
involving  the  variances. 


2. 5. 1  Numerical  Example 

We  return  to  the  shepherd  of  section  2.4. 2,  who  has  now 

decided  to  apply  the  sprt,  and  has  arbitrarily  declared  that  the  test 

must  have  p .  2t  .  69  and  p,  .5 . 31  .  The  random  variable  z  of 
a  z 

equation  (2. 12)  becomes 
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z  =  in  [  L  (x)  ]  =  x  -  J 

so  that 

B0{z>  =  V2J  "I  •  11 

gl(2)  =  £NU;  1)  . 

In  addition,  the  assumption  is  made  that  the  inequalities  of 
equation  (2.  11)  are  equalities,  so  that  A  1  ®  =  ~-|^  anc*  t^lus 

a  =  .  8  ?  b  =  - .  8 . 

The  resulting  test  is  to  compare  z  =  x  -  y  with  a  and 

M 

b  ,  so  that 

if  3  <  x  <  1 . 3  :  take  another  measurement 
x  >  1 . 3  :  go  to  the  pasture 
x  <  -.  3  :  go  to  town  . 

If  another  measurement  is  taken,  then  z^+z2  =  x^  -  ~  -fx^-  — 
is  compared  to  .  8  and  8  ,  and  so  on. 

Since  Eg  (z)  =  -  j  and  (z)  =  j  the  approximate  formulae 
(2.  13)  give 

ASNQ  =  -[.8  -  .69  0*6)]  2  =  .6 
ASNj  =  [.8  -  .31(1.6)]  2  =  .6 

These  results  are  puzzlirg  to  the  shepherd,  as  he  sees  that 
the  average  number  of  times  he  will  have  to  take  measurements  is  less 
than  one  J  However  he  notices  that  the  standard  deviation  of  the  random 
variable  y  is  1  and  since  (a-b)  =  1.6  ,  the  requirement  that  z  be 
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email  compared  to  (a-b)  is  not  fulfilled,  eo  he  does  not  expect  equatioj 
(2. 13)  to  hold.  The  exact  solution,  as  shown  in  Appendix  A,  is  unavaili 

What  should  he  do  now?  In  fact,  the  choice  of  arbitrary  p.  , 

d 

p^  was  really  forced  upon  the  shepherd  by  the  constraints  of  the  spjrt. 
Being  a  practical  person,  he  still  really  desires  to  minimize  his  costs. 
Do  the  measurements  cost  him  anything?  After  all,  if  they  don't,  he 
would  be  satisfied  with  measuring  all  night  and  eventually  (because  of 
the  central  limit  theorem)  he  would  be  certain  about  the  state  of  nature 
Apparently,  a  new  model  and  solution  must  be  used. 

2,  6  Need  for  extension  of  the  models 

In  section  2.  2  it  was  pointed  out  that  the  defect  of  previous 
approaches  to  what  has  been  conventionally  called  the  ’'search"  problei- 
has  been  in  the  neglect  of  the  possibility  that  the  detection  and  sensing 
devices  used  could  produce  spurious  signals,  and  hence  false  alarms. 
These  theories,  however,  were  certainly  thorough  in  the  applications 
of  cost  and  utility  models,  as  well  as  prior  probabilities. 

The  non- sequential  work  in  STSD  includes  false  alarms, 
a  cost  structure,  and  cognisance  of  prior  information,  but  this  is 
limited  to  the  simple  hypothesis  test.  As  we  have  seen,  this  contains 
only  a  rudimentary  element  of  search,  in  that  the  decisions  are  res¬ 
tricted  to  terminal  enss  after  a  single  observation,  There  is  no 
structure,  for  example,  that  allows  one  to  stop  and  decide  DQ  or 
Dj  on  the  basis  of  no  observations  at  all. 

On  the  other  hand,  with  the  introduction  of  sprt  the  cost 
structure  and  prior  information  are  lost,  the  decision  maker  is  left 
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with  the  task  of  assigning  errors  of  the  first  and  second  kind,  and  there 
is  no  consistent  way  to  take  into  account  the  experimental  cost.  In 
addition,  the  sprt  must  be  analysed  in  terms  of  the  inequality  (2.  11), 
which  becomes  an  equality  only  under  conditions  that  are  equivalent 
to  requiring  that  «  1  ,  1-p^  *  •  Although  this  condition  might 

seem  desirable,  we  have  seen  in  the  numerical  example  above  that 
minimum  cost  solutions  do  not  particularly  satisfy  it. 

Although  some  pioneer  work  by  Blackwell  and  Girshick  (7) 
has  been  done  some  time  ago  in  an  attempt  to  i  '  "T^ayes  ''  structure 
on  sequential  hypothesis  testing,  the  applications  were  not  geared 
towards  the  problems  of  detection.  The  fallowing  chapters  will 
consider  a  minimum  cost  analysis  of  sequential  detection,  under 
some  conditions  that  are  perhaps  more  reasonable  than  those  proposed 
before,  The  technique  of  ar.^ysis,  that  of  using  dynamic  programming, 
will  then  lead  to  the  consideration  of  yet  another  problem,  one  that 
could  not  otherwise  be  attacked  by  the  sprt  type  of  approach. 

Before  going  on  to  such  analysis,  it  is  of  interest  to  mention 
here  some  work  that  has  been  published  in  recent  years  on  the  subject 
of  sequential  detection  theory.  To  repeat  fhe  point  made  above,  none 
of  this  work  takes  into  account  the  overall  at  of  operation,  and  thus 
the  results  are  interpreted  in  terms  of  a  and  p  errors  as  well  as 
expected  test  lengths. 

In  many  realistic  search  problems,  the  target  characteristics 
are  not  exactly  known,  and  so  some  of  the  parameters  in  (x)  are 
themselves  random  variables.  The  searcher  not  only  must  decide 
upon  the  presence  or  absence  of  such  a  target,  but  an  estimate  of  the 
unknown  parameters  is  also  required.  Such  estimation  problems. 
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(sometimes  referred  tc  as  "clas s if1  cation “),  extremely  difficult  to 
analyse  in.  a  sequential  manner,  have  been,  treated  by  Selin  (41)  and 
Turner  (44). 

Some  work  has  been  done  in  an  effort  to  obtain  exact  solu¬ 
tions  to  the  random  walk  of  the  sprt  as  discussed  in  section  (2.5)  wher. 
the  p. d.f.  s  involved  are  peculiar  to  those  found  in  practical  detection 
devices.  Reed  (39)  derives  some  theoretical  results,  and  Marcum 
and  Swerling  (31),  for  example,  produce  Monte-Carlo  simulation® 
for  practical  cases. 

Some  arbitrary  "many  threshold1'  decision  rules  have  been 
analysed.  An  example  is  the  rule  presented  by  Kennedy  (24)  which 
starts  a  sequential  test  only  after  an  initial  signal  exceeds  some 
fixed  threshold,  this  latter  threshold  set  to  limit  the  number  of  times 
the  sequential  test  is  applied. 

Bussgang  (8,  9)  and  Middleton  (34),  among  others,  have 
treated  sequential  detection  under  the  most  general  noise  and  signal 
statistics c  Helstrom  (20)  and  Preston  (38)  have  compared  the  sprt 
to  ths  fixed  length  test  for  practical  examples  of  search  radars. 

The  particularly  interesting  problem  of  sequentially  testing 
a  continuous  glgnal  (the  experiments  are  not  done  at  discrete  times) 
has  been  treated  by  Selin  (41).  In  this  work  he  also  indicates  the 
optimal  sequential  test  when  the  noise  is  correlated  in  time.  Gray 
(17)  has  also  contributed' to  this  problem. 
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CHAPTER  III 


STATIONARY  TARGET 


This  chapter  describes  a  more  general  problem  than  those  present 
earlier,  in  that  it  provides  a  combination  of  the  Distribution  of  Effort  and 
STSD  appioaches.  The  mode  of  attack  and  the  mathematical  technique  in¬ 
volved  reflect  the  sequential  nature  of  the  solution  as  well  as  the  minimum 
expected  cost  aspects  of  decision  theory.  The  use  of  stochastic  dynamic 
programming  in  this  regard  has  been  generally  indicated  by  Bellman  et  al 
(4),  as  well  as  Blackwell  and  Girschick  (7).  Goode  (16)  has  applied  the 
principles  involved,  but  with  a  model  not  particularly  suited  to  search. 

The  strategy  developed  in  this  chapter  will  turn  out  to  be  a 
modified  form  of  the  sprt,  with  the  coat  factors  and  prior  probabilities 
appearing  as  intrinsic  parameters.  In  addition,  the  resultant  minimum 
cost  arises  as  a  natural  consequence  of  the  calculations.  This  makes 
it  possible  to  compare,  in  an  efficient  way,  the  cost  of  the  optimal  strategy 
with  costs  of  certain  non-optimal  strategies  that  will  be  considered  for 
practical  reasons. 

3. 1  Problem  Statement 

1.  The  target  is  either  present  (S^)  or  not  present  (Sq)  in  the 
cell  of  interest,  with  prior  probabilities  P  and  1  -P  res¬ 
pectively,  and  remains  so  for  the  entire  search  (the  target 
is  stationary  ). 

2.  If  the  cell  is  observed,  the  result  is  a  random  variable  x 
which  has  p.d.f.  (x)  or  p^  (x)  as  the  state  of  nature 
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is  Sq  or  .  Observations  take  place  at  unit  time  interval 

3.  After  every  observation  (including  the  zeroth)  the  searcher 
makes  one  of  the  following  decisions: 


=  Decide  target  is  present 
=  Decide  target  is  not  present 
=  Wait  for  another  observation. 


4.  The  decisions  D-  and  D,  are  terminal  decisions,  and 

O  1 

end  the  process.  The  costs  to  the  searcher  making  decision 
D.  given  S.  is  true  are  C..(i,  j  =  0,1),  with  C. ,  =  C__  -  0 

i  j  ij  11  00 

5.  The  decision  W  continues  the  process  at  least  one  more  tin~ 
unit.  The  cost  of  this  delay  depends  upon  the  state  of  nature, 
and  is  W.  if  S.  is  true  (i  =  0,  1)  . 

6.  The  objective  of  the  searcher  is  to  minimize  the  expected 
cost  of  a  search.  The  strategy  (that  is,  the  rule  for  making 
decisions  given  a  sequence  of  observations)  that  achieve** 
this  minimum  expected  cost  is  the  "optimal"  strategy. 

The  fact  that  the  cost  of  experimentation  depends  upon  the  state  of  natur 
is  what  makes  this  model  particularly  applicable  to  search  problems. 
Tor  example,  consider  the  ease  of  active  sonar  search  against  a  sub¬ 
marine  that  is  suspected  of  being  in  a  missile -firing  position.  Clearly, 
the  searcher  wojld  like  to  make  a  decision  as  soon  as  possible  if  the 
submarine  is  indeed  present,  whereas  if  the  indications  are  that  the 
submarine  is  not  present,  the  searcher  could  afford  to  spend  more 
time  making  sure.  Thus  in  many  search  situations  we  should  expect 

to  have  W,  >  W„  . 

1  0 


♦This  assumption  is  the  equivalent  of  zero -normalizing  the  cost  for 
any  given  P  ,  and  is  made  to  simplify  the  algebra. 


We  shall  now  write  a  functional  equation  from  which  the  optimal 
strategy  car  be  obtained,  the  equation  being  a  straight-forward  application 
of  Bellman's  Principle  of  Optimality  (3). 

Let  C  (P)  be  the  minimum  cost  of  search  obtained  by  following 
the  optimal  strategy,  where  P  is  the  a  priori  probability  that  the  target 
is  present.  This  minimum  cost  will  be  the  smaller  of  the  costs  resulting 
from  the  three  possible  decisions  that  can  be  made  at  that  time: 


If  the  decisions  are  or  ,  the  costs  are  due  to  terminal 

wrong  decisions:  PC^  and  (I-P)C.q  respectively. 

If  the  decision  is  W  ,  the  cost  is  PWj  +  (1-P)  Wq  plus  the 
cost  of  continuing  from  then  on,  having  observed  some  value  of  x  . 
bability  of. observing  a  valup  bejkvtesn.  x  arid  x+dx(i)5  Pp^x)  d£+^lrl?)  pQ-(x)  •. 
Having  observed  the  value  x  ,  however,  the  probability  now  of  the 
target  being  present  is 

{xISjHSj}  pj  (x)p 

{si,x}=  {xjSj}  {Sj}  +  (x[S0)  {  S0>  =pl(x)P+  P0(x){l-P)  • 


The  equation  for  C  (P)  may  thus  be  written 

t 


C(P)  «  min\ 


0-P)C 


10 


'1 


PC 


01 


X  D„ 


[p  Wj+(1  -P)W0+ j[P  Pl(x)+(1  -P)p0(x)J  C 


■  so 


Pj  (*)  p 

PjWPs^llrtll.P) 


<*#  :  VT 


For  rtotational  convenience  let  us  define  the  minimum  cost 
of  terminal  decision  to  be  T  (P) 
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T  (P)  =  min 


U-P)Cio  :  Dj 


and  the  unconditional  p.  d.f.  of  receiving  a  value  x  to  be  g  (x) 

g  (x)  =  P  px  (x)  +  (1  -P)  pQ  (x) 

so  that  equation  (3. 1)  may  be  re-written 

r 

T{P) 

C  (P)  -  min  |  (■  x)p-|  0.2) 

PW1  +  !1-P)W0+  ]  gWcj-j^-Jdx. 

-00 

3.  2  Arbitrary  Truncation  of  the  Search 

In  order  to  find  a  way  to  solve  equation  (3.  2)  we  shall 
consider  the  following  arbitrary  truncation  of  the  search:  At  the 
start  of  the  process,  the  searcher  is  told  that  he  has  only  n  available 
possible  observations  (or  equivalently,  time  units)  left.  If  the  decision 
W  is  made,  at  the  next  decision  there  will  be  only  n-1  possible  obser¬ 
vations  left,  and  so  on.  If  n*0  ,  then  one  cf  the  terminal  decisions 
must  be  made. 

If  we  define  C  (P)  to  be  the  minimum  cost  of  search  given 
n 

that  there  are  n  available  observations  remaining  before  a  terminal 
decision  must  be  made,  we  may  write  the  equivalent  of  equation  (3.  2)  as 
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The  boundary  condition  on  this  equation  is  given  when 
n=0  ,  for  then  a  terminal  decision  is  required  and  so 


CQ  (P)  =  T  (P)  =  min 


d-P)C 


10 


PC 


01 


(1-P)C10  P£:P*  :  Dj 


vPC01 


P=3P*  :  D 


0 

(3.4) 


where 


P*  = 


'10 


C10  +  C 


01 


It  is  of  interest  to  go  through  the  calculation  of  (P)  in 
some  detail,  since  the  behavior  of  the  terms  affords  an  insight  to 
the  decision  structure  of  the  solution,  and  to  the  form  of  the  minimum 
cost  as  a  function  of  P  . 


Pj  (x)  P~ 
_  g(x)  J 


In  order  to  calculate  Cj  (P)  we  must  first  evaluate 


This  latter  expression  is  the  minimum  cost  given 


that  one  observation  x  has  been  taken  and  that  a  terminal  decision 
must  be  made.  From  equation  (3.4),  we  find,  after  some  manipulation, 
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(l-P)p0(x)Cio  pl^,^l~p  p* 

iU)  ”  ~F~  up*  :  Di 

PPjCx)C01  Pl^x)_l-P  p* 

g(x)  P0(x)  “  P  l-F*  1  D0 

(3.5) 

1-P  P*  . 

If  we  let  — p-  p*  =  *  an<*  recall  that  Pj  (x)/p0(x)  =  1 

the  likelihood  ratio,  we  see  that  the  rule  presented  by  equation  (3.5) 
is  identical  with  equation  (2.5).  (We  See  now  that  this  corresponds 
to  the  solution  for  the  truncated  search,  and  given  that  an  obse.  /ation 
was  taken:  two  restrictions  that  are  not  contained  in  the  general 
problem  statement. )  In  particular,  by  letting  x*  again  be  the  solution 
to 


L(x*)  =  A(P) 

and  putting  the  results  of  equation  (3.5)  into  equation  (3.  3)  (with  n~  1 ) , 
'we  obtain 


The  relation  between  this  search  and  the  one  described 
in  section  2. 4  can  now  be  further  explored.  H  we  let  Wj  -  Wq  -  0  , 
(and  recall  that  we  have  let  Cqq  =  C^j  =  0)  and  neglect  th«  possibility 
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A 


of  a  terminal  decision  before  taking  any  observations  (the  T  (P)  term) 
then  equation  (3.  6)  is  identical  to  equation  (2.9).  If  we  include  these 
factors  however,  as  required  by  the  general  problem,  the  minimum 
costs  and  strategies  are  quite  different. 

The  decision  structure  presented  by  equation  (3.6)  may 
be  easily  visualized  by  drawing  a  sketch  of  the  terms  in  the  right 
hand  side,  as  in  Figure  3.  1.  T  (P)  <  seen  to  be  two  straight  lines 

meeting  at  P=P*  .  Let  us  define  L  <P)  to  be  the  bottom  of  the  right 
hand  side  of  equation  (3.  3).  Then  (P),  the  lower  part  of  equation 
(3.  6)  can  be  seen  to  have  the  properties  as  shown  in  Figure  3.  1. 

G1  <0)  “  wo 

Gl(1)  =  wl  • 

The  interception  of  T  (P)  and  (P)  occurs  at  two 
points  (if  at  all),  these  two  points  bsing  at  P  =  and  P  =  Sj 

<n-V- 

From  the  figure  we  can  see  that  T  (P)  <  G,  (P)  in  the 
two  regions  0^P<Yj  and  ^<p<i  ,  so  that 

(1-P)C10  fi^Sl  :  D, 

Cl<P)=\PC0i  ^0^P*rl:D0  ^ 

(pWj+d-PjWQ+PC^  Jpi(x)dx+(l-P)C10  Jp0(x)dx  ■yl<P^6l 

-00  x* 

(3.7) 

where  6^  is  the  solution  to 

O-SjJCjq  =  G^fij)  (3.8) 
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and  is  the  solution  to 

(1-VciobCW-  (3'9) 

Since  x*  is  a  function  of  P  ,  and  the  integrals  in  (•) 
are  usually  not  expressable  in  closed  form,  the  solutions  of  equations 
(3.7)  and  (3.8)  must  be  obtained  numerically. 


3.3  C  (P)  as  the  Limit  of  the  Truncated  Search 

Now  that  (P)  has  been  obtained,  we  can  proceed  to 
iteratively  solve  equation  (3.  3)  by  letting  n  =  2,  3,  .  .  .  and  so  on. 
These  iterations  will  also  produce  the  decision  regions  bounded  by 
the  interception  of  T  (P)  with  (P),  G^  (P),  •  •  •  at  points  7^,  5^ 
73» 53  »  etc. 

In  particular,  we  may  re-write  equation  (3.  3) 


cn(P)  *{ 


(l-P)C 


10 


PC 


01 


6  :£P=£1 
n 

O^P^T 
.00 


I  r  I  piWp 

{  P  Wj+(1  -P)  W0+Jg  (x)  cJ  — 


g(x) 


I  dx  7 

n.  n 


.00 


with  the  boundary  condition 


where 


c0(p)  = 


(l-P)C 


10 


VPC01 


P*=SP==1 


OSPSP* 
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If  we  postulate  that  in  the  limit  as  C  (P)  approaches 

n 

some  function  of  P  independent  of  n  ,  then  by  letting  n-*-«e  in  both 
sides  of  equation  (3*3),  we  see  that  this  function  is  defined  by  equation 
(3.2)  to  be  C  (P)  .  Thus,  if  in  fact  this  limit  exists,  C  (P)  can  be 
calculated  to  whatever  degree  of  accuracy  that  is  desired  by  successive 
iterations  of  equation  (3.3). 

What  is  more  important,  from  the  point  of  view  of  the  strategy 

that  is  associated  with  C  (P)  ,  is  the  fact  that  if  C  (P)  can  be  obtained 

to  any  degree  desired,  then  the  points  y  =  lim  y  and  5  =  lim  6 
*  n  5*  n  n 

defining  the  decision  regions  can  also  be  obtained  as  closely  as  desired. 

It  remains,  then,  to  prove  that  successive  iterations  of 

equation  (3.  3)  will  produce  a  C^(P)  that  will  converge  to  some  C(P)  , 

and  that  the  decision  points  y  and  5  dc  exist  as  limits  of  y  and 

n 

5  ,  and  in  a  non -degenerate  way  (that  is,  y  >  0  ,  6  <  1). 

n 

3. 3. 1  Convergence  Proofs  for  C  (P)  ,  y  and  6 

n  n  n 

We  consider  the  defining  equation  of  C  (P) 

13. 

C  (P)  *  min  (T(P),  G  (P)J  02JP<1 
n  n 

where 

T(P)  *  mlnCd -P)C1Q  ,  PC01] 

O  90  '  P,  (x)  P 

V*1  *  +  » -p>wo  ♦  -J5J-  to  . 

-«  *" 

and  the  points  y  and  6  are  defined  by 


\coi 1 W 


U-4  )c.„  =  G  (4  )  . 
n  10  n  n 
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The  costs  Cqj  ,  C^q  ,  and  Wq  are  all  non -negative,  and  g(*) 
and  p^(*)  are  p.  d.  f.  s. 

Theorem  A:  C  (P)  5:  0  for  all  n=tO 
- -  n 

(This  theorem,  provides  a  lower  bound  for  the  iteration 
process. ) 

Proof 

1.  Since  S  0  ,  then  T  (P)  ^0  , 

2.  (P)  =  T  (P)  by  definition  so  that  (P)  ^  0, 

3.  .If  (P)  S:  0  for  all  P  ,  then  any  average  of  C^(P)  over 
P  is  >0, 

4.  (P)  is  the  sum  of  PWj  +  (l-P)Wg  plus  an  average 

of  C  (• )  ,  all  >  0  ,  so  that  G  ^  (P)  0  . 

n  nrl 

5.  Finally  Cn+1  =  min  [T(P)  ,  Gn+1  (P)]  5:0  by  (1)  and  (4), 
and  the  induction  is  complete  starting  with  the  result  of  (2). 

Theorem  B:  C  . ,  (P)  <  C  (P> 

-  n+1  n 

(This  theorem,  coupled  with  Theorem  A,  establishes  the 
absolute  convergence  of  (P)  to  some  limit  function 
(defined  as  C  (P)  ). ) 


Proof 

1 .  Cq  (P)  =  T  (P)  by  definition, 

2.  Cj  (P)  =  min  [T  (P)]  =  min  [CQ  (P)  ,  Gj  (P)J  £  CQ  (P)  , 

3.  Suppose  Cn+1  (P)  -  Gn(P)  ^  0  for  some  n  , 
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4-  °a+2(P| 


p®°  rpiwpl  Tp,  (x) 

(p>  *  -  j*  w  cn 

„  fm  1—  °  4 


=  Jg  w 


Pj  (x)P 


n+1  g(x) 


Px  (x)  P 

n  g(x) 


by  (3)  and  the  fact  that  g  (x)  >  0  for  all  x  .  So  we 
conclude  that  Gn+2  (P)  <  G  (P), 

5.  Finally,  C^^iP)  =  min  [T  (P)  ,  G^J  S  min  [T  (P)  ,  (P, ; 

Cn+1  ^  ky  (4),  and  the  induction  is  complete  starting  with  (a 

Theorem.  C:  a)  T  <  y 

-  n+j  n 

b)  6  Sr  6 

n+1  n 

(This  theorem,  in  conjunction  with  Theorem  D,  establishes 

the  convergence  of  the  decision  points  y  and  6  to  their 

n  n 

respective  limits  7  and  6  . ) 


Proof 

(The  proof  is  given  for  part  b),  the  proof  of  a)  being  essentially 

the  same) 

*'  °n*4n+l*  S  Gn*l  (4n+l*  hy  ^  o{  theorem  B  . 

2’  C10(I'4n+lJ  S  °n+l  *6n+l*  by  definition . 

3.  Gn(P)  <  (1-P)C10  for  P  <  6n  by  the  defining  equatton  . 

4.  To  prove  by  contradiction,  we  suppose  6  >6 

n  n+1  . 

5a  By  (3),  (2)  and  (1)  we  have  the  contradiction 

■  °n+l<W  s°n<5„+1>  . 
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Theorem  D :  If  >  0  ,  WQ  >  0  then 

a)  y  ^  r  >  0 

n 

b)  6  2£  A  <  1  for  all  n 

n 

where  F  and  A  are  explicit  function*  of  the  cost  terms. 

(This  theorem  shows  that  the  decision  regions  described  by 
the  decision  points  7  and  6^  are  bounded  by  terms  -greater 
than  0  and  less  than  1  respectively,  and  so  that  for  sufficiently 
large  or  small  P  a  terminal  decision  will  always  be  appropriate 
Since  the  theorem  holds  for  all  n  ,  it  holds  for  7  =  lim  y 

n 

and  6  =  Hm  6  . ) 

n-*-oo  n 
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Theorem  E:  C  (P)  >  0  for  at  least  some  P 
-  n 

(This  theorem  is  immediately  proven  by  noting  that 

C  (P)  =  T  C_.  >  0  for  all  n  .  The  result  provides 
n  01 

the  non-degeneracy  of  (P)  ,  and  in  particular  the 
non-degeneracy  of  C  (P)  . ) 


3.  4  Implementation  of  the  Optimal  Search 

Before  going  on  to  a  specific  example  of  the  calculation  of 
C  (P)  and  the  associated  strategy,  it  is  instructive  to  consider  the 
implementation  of  the  resulting  strategy.  We  assume,  then,  that 
C  (P)  ,  y  and  6  have  been  obtained.  The  search  process  proceeds 
as  follows. 

1.  If  0rSP<7  or  $<P:£l  ,  make  the  appropriate  terminal 
decision.  The  search  is  c/er,  and  the  expected  cost  is  T  (P)  . 


2.  If  7^P2£6,  then  take  an  observation  x1  .  Calculate  the 
a  posteriori  probability  that  the  target  is  present: 


r  i  ■»  Pi  (*i)p  PiUJP 

S*  '  *  gtxj)  *P1'(x1)F+P0(x1)(l-P)  * 


Compare  this  new  probability  with  y  and  6  •  This  is  the  equivalent 

of  comparing  the  likelihood  ratio  L(x^)  =  p?  (xj)/pq(Xj)  with  the 

values  and  r~  with  the  result 

P  1-7  P  1-6 

if  L(x. )  <  :  D 

1  P  1  -  7  0 

L<*i>>¥nr 


1-P  _y_ 

"  p  1 


6 

1-6 


W 
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3.  If  L(Xy)  lie*  in  the  W  region,  take  another  observation  x^  , 
and  compare 


,  pl<xl,pl<x2> 

L(xl'x2)-p„(x,)p„<x,) 


with  the  values 


1-P 


0 

y 


P  1  -y 


‘1 
and 


0  '  2' 

1  -P  6 
P  1-6 


and  so  on. 


We  see,  then,  that  we  have  generated  a  Wald  sprt  equivalent 
to  the  one  described  by  equation  (2.  10),  but  with  the  decision  points 
A  and  B  given  by 


A  = 


1-P 

P 


6 

1-6 


R -hi  JL. 

The  advantages  of  the  search  just  derived  over  the  sprt  are  apparent 
in  three  important  respects.  First,  the  values  of  7  and  6  ,  and  thus 
the  decision  thresholds,  depend  by  definition  upon  the  various  cost 
factors  and  a  priori  probabilities  involved  in  the  search.  The  previous 
method  of  assigning  arbitrary  a  and  |3  errors  to  determine  A  and 
B  can  be  now  examined  for  consistency,  if  not  completely  replaced. 
Second,  if  it  is  at  all  necessary  to  truncate  the  search,  the  decision 
points  described  by  7^  and  6^  are  the  result  of  a  straightforward 
and  well  defined  optimization  process,  whereas  the  truncation  of  Wald- 
type  tests  in  the  literature  are  rather  arbitrary. 

The  third  advantage  of  the  approach  just  developed  is 
the  important  fact  that  not  only  is  the  strategy  calculated,  but  C  (P), 
the  minimum  cost  obtained  by  using  that  strategy,  is  a  natural  by¬ 
product.  Thus,  if  we  wish  to  compare  two  different  systems,  it  is 
only  necessary  to  compare  their  respective  C(P)'s,  and  choose 
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the  one  which  has  the  higher  C  (P)  at  the  P  of  interest.  This 
point  will  be  the  subject  of  some  later  sections,  in  which  certain 
near -optimal  strategies  are  considered  and  compared. 

3.  5  Examples  and  Sample  Calculations 

With  the  assurance  of  section  (3.3)  that  the  process  will 

converge,  it  is  extremely  easy  to  successively  iterate  equation  (3.  3) 

to  any  desired  degree  of  accuracy.  In  particular,  by  referring  to 

the  proof  of  theorem  B,  we  see  that  if  C  (P)  is  at  most  e  away 

n 

from  C  .  (P)  ,  then  G  .  (P)  will  be  within  e  of  G  (P)  ,  forcing 
n  - 1  n+ 1  n 

a  like  bound  in  the  x  '*•  imum  difference  between  C  ,  ,  (P)  and  C  (F)  . 

n+ 1  n 

Thus,  as  in  the  work  that  follows,  we  may  select  some  e  and  stop 
the  iteration  when  max  (C^(P)  -  (P)}  <  C  . 

Another  helpful  computational  technique  is  available. 

This  involves  approaching  the  limit  C  (P)  from  below  rather  than 
from  above  by  starting  out  with  (P)  *  0  rather  than  CQ  (P)  s  t  (P) 
as  before.  That  this  process  also  converges,  and  to  the  same  C  (P)  , 
is  easily  shown  by  simple  modifications  to  the  proofs  of  theorems  A 
and  B.  This  "convergence  from  below"  suffers  in  not  having  any 
intuitive  baeie  for  the  index  of  the  convergence,  but  has  the  advantage 
of  converging  more  rapidly  than  the  standard  way  for  very  low-minimuir 
cost  systems.  It  also  serves  to  check  the  accuracy  of  whatever  numer'c 
approximations  might  be  made  in  the  calculations,  by  reaching  C(P) 
independently. 

3.  5. 1  Known  Signal  in  Gaussian  Noise 

For  general  illustrative  purposes,  we  shall  consider  the 
simple  example  of  the  search  for  a  target  having  known  characteristics 
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in  the  presence  of  additive  noise  having  known  (statistical)  properties. 
In  particular,  let  the  known  target  signal  be  s  (t)  ,  O^t^T  ,  with 


the  signal  energy.  We  assume  the  noise  is  Gaussian  in  magnitude 

N  N 

with  auto -covariance  —  6  (t)  ,  where  —  is  the  known  noise  energy 
density.  It  can  then  be  shown  (Helstrom  (19),  for  example)  that  if 
a  statistic  x  is  taken  to  be  proportional  to  the  cross -correlation  of 
s  (t)  with  the  received  signal  v(t)',  i.e. 


then  x  is  sufficient  and  has  the  p.d.f.  s.  under  and  S1  of 


p0  (*)  =  fN(x;  o,  i) 

P1  M  =  fN*X;  **’  ^  l 


(3.11) 


and 


where 

f 


i  The  literature  contains  many  analyses  based  upon  this 
model  of  signal  and  noise  characteristics.  It  has  the  advantage  of 
being  a  good  approximation  to  many  realistic  situations.  In  addition, 


-51- 


thfj  “strength"  of  &  signal,  relative  to  noise  may  be  represented 
the  single  variable  p  ,  the  familiar  signal -to -noise  ratio.  As  w 
have  seen  in  the  discussion  of  the  ROC  in  Chapter  II,  an  increase 
p  {all  other  system  variables  remaining  constant)  is  always  dev. 
but  p;  is  often  fixed  for  any  given  detection  device  and  target. 

The  convergence  of  C^{P)  to  G(P)  from  above  an<’  ' 

is  illustrated  in  Figure  3.2,  with  the  parameter  values  C^j  =  C, 

Wq  *  Wj  *  1  and  p  «  1  .  Table  3. 1  shows  the  convergence  of 

and  6  for  the  same  parameter  values.  Figure  3.3  shows  c  {? 
n 

for  various  other  values  of  Wq  ,  and  p  . 

The  calculations  were  performed  on  an  IBM  7090  at  th» 
M.  I.  T.  Computation  Center,  and  consisted  of  simple  iterations  i 
equation  (3.  3).  For  the  examples  in  Figure  3.  2  and  3.  3,  the  P- 
was  represented  by  points  at  intervals  of  .01,  and  C  .  [IP)  was 
linearly  interpolated  between  them,  The  integration  was  done  nu, 
rically  by  simple  trapesodial  addition  to  an  error  of  roughly  .  1%. 
The  value  of  <  used  was  .001,  and  as  can  be  seen  from  the  figur 
the  convergence  is  quite  rapid.  A  flow  chart  of  the  computer  pro 
involved  appears  in  appendix  C. 

3.5.2  Numerical  Example 

The  shepherd  of  the  previous  examples  now  realizes  th< 
his  experimentation  (observation  of  readings  on  the  baa -meter)  dc 
cost  something.  In  fact,  because  of  the  rural  location,  electricity 
is  very  expensive,  and  he  estimates  that  the  meter  operating  cost 
is  $  10  for  every  ten-minute  integration  period  (a  fixed  time  unit 

I 
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V 


I - I 


Iteration 

Number 

C0(P>  = 

T(P> 

c0iP)  =  o 

n 

5 

n 

6 

n 

0 

.500 

.500 

" JJ  "  — 

-- 

1 

.  389 

.606 

.100 

.900 

2 

.  373 

.  621 

.  183 

.814 

3 

.  369 

.627 

.250 

.744 

4 

.366 

.629 

.304 

.689 

5 

.  365 

.625 

.  339 

.  655 

6 

.  354 

.  639 

7 

.361 

.  632 

8 

.  364 

.  631 

9 

— 

.  365 

.  630 

TABLE  3.  1 

Convergence  o£  yr  and  6^  ,  p  =  1,  WQ  «  W1  =  1, 

Si'Cjo--1- 


required  by  the  meter  for  each  observation).  Using  the  reduced  cost 
matrix  of  section  (2.  4.  2)  (in  amts  of  $  100) 


0  1 


he  notes  that  CQ1  =  C1Q  -  1  ,  and  the  meter  operating  cost  assigns 
W0  =  ^1  =  *  *  *  figure  3.  2  and  Table  3.  1  give  the  strategy: 

1*  K  P  >  ‘  63  '  don,t  take  any  measurements  at  all,  and  go 
directly  to  the  pasture 

2.  If  F  <  .  37  ,  don't  take  any  measurements  at  all,  and  go 
directly  to  town 


3. 


If’37— P2S.63,  take  v  measurement  x^  . 
compare  L^jt^)  -  exp(x, -.5)  with  the  value 


1-P 


(.59)  ,  and 


1-P  6 
P  1-6 


1-P 

-jT<1.85) 


Then 


1-P  7 
P  1  -7 


i.  If  exp  [xj-.  5]  is  outside  the  range  (-—(.59)  ,  (1.85)  )  , 

take  the  appropriate  terminal  decision.  If  exp[xj  -  .  5}  is 
within  that  range,  take  another  observation  x^  ,  and 
compare  exp[(Xj 5)+(x2>.  5)]  with  that  range,  and 
•o  on. 


Thus,  if  P  a  -  ,  the  optimal  decision  rule  requires  at  least  one 
observation,  and  offers  an  expected  loss  (in  dollars)  due  to  wrong 
decision  of  39  (from  Figure  3.  2),  or  a  total  loss  of  -50+39=-ll 
By  using  the  STSD  decision  rule  as  in  section  (2.4.  2),  the  cost  was 
-19  ,  to  which  we  must  add  10  ,  the  cost  of  the  required  experiment 
for  a  total  of  -9  .  The  optimal  sequential  rule  thus  saved  $  2  . 


The  saving  becomes  even  more  substantial  as  P  approaches 
0  or  1,  and  is  $  10  (the  cost  of  the  needless  experiment  required  by  STSD) 
for  the  certainty  prior  information  conditions  P  =  0  or  1 .  This  is  shown 
in  Figure  3.  4. 

If  the  cost  of  experimentation  were  much  smaller  (say  $  1 
per  observation,  so  that  Wq  =  Wj  =  .01),  then  the  advantage  of  using 
the  optimal  sequential  search  becomes  even  more  pronounced,  as  is 
also  seen  in  Figure  3.4 

3.  6  Fixed  Threshold  Sequential  Search 

Now  we  shall  consider  some  practical  limitations  to  the 
applicability  of  the  optimal  search  just  derived.  We  have  assumed 
that  the  result  of  any  observation  is  the  measurement  of  some  continuous 
random  variable,  the  value  observed  being  called  x  .  In  order  to 
implement  the  search,  it  is  necessary  to  be  able  to  calculate  L  (x) 
in  order  to  obtain  the  posteriori  probability  needed  for  comparison 
to  y  and  6  . 

Many  physical  detection  systems,  however,  are  limited 
in  their  capability  to  measure  x  as  a  continuous  variable.  Others 
have  very  limited  mathematical  capability  (whether  space -cost  limited 
computers,  or  time-ability  limited  humans)  and  the  storage  and  cal¬ 
culation  of  continuous  likelihood  functions  are  beyond  them. 

A  particular  form  of  limitation  typical  of  many  systems  in 
use  today  will  be  discussed  in  this  section,  with  the  object  of  provi¬ 
ding  a  means  of  comparing  the  best  results  available  with  these  limited 
systems  to  the  results  obtained  in  the  previous  sections. 
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f  The  limitation  involved  will  be  the  required  use  of  a  fixed 

f  ti  ■  m" 

i 

t  threshold  for  every  observation,  with  the  output  of  the  observation 
being  simply  "over  threshold"  or  "under  threshold  .  An  example 
that  illustrates  this  is  the  use  of  an  active  sonar  that  every  cycle 
displays  either  a  "blip"  or  not.  Inside  the  sonar  unit  there  is  a  dis¬ 
crimination  device  that  can  only  tell  whether  or  not  the  received 
signal  statistic  is  over  or  under  some  pre-set  threshold,  and  a  blip 
is  displayed  or  not  as  a  consequence.  The  observer  then  must  make 
decisions  on  the  basis  of  observing  a  series  of  binary  variables  rather 
than  a  series  of  continuous  ones. 


It  is  convenient  here  to  let  "indication"  mean  the  exceeding 
of  the  fixed  threshold  by  a  signal.  This  word  must  be  used  with  some 
care,  in  that  an  indication  alone  does  not  imply  any  decision  concerning 
the  presence  or  absence  of  the  target.  An  indication  is  just  a  way  of 
reporting  the  output  of  a  particular  (threshold)  detection  device. 
Decisions  are  to  be  made  on  the  basis  of  (perhaps)  many  indications. 


The  problem  facing  us  now  is  essentially  identical  to 
that  stated  in  section  (3.  1),  witn  the  substitution  of  a  binary  random 
variable  y  for  the  continuous  x  ,  with  probability  mass  functions 
hQ  (y)  and  h^  (y)  replacing  to  (x)  and  Pj  (x)  .  If  we  let  y  take 
the  values  0  or  1,  we  have 


h0  M  = 


1 -f  y=0 


h!  (y)  = 


1 -d  y=0 
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where  f  and  d  are  the  "false  indication"  and  "detection"  probabilitie 
defined  by 


f  =  f(x*)  =  \  Pn(x)dx 


(3.12) 


00 

d(x*)=  ^Pj 


(x)  dx 


where  x*  is  the  threshold  that  is  assumed  to  be  fixed  throughout  the 

length  of  the  search.  (Note  that  these  equations  are  identical  to 

equations  (2. 7)  for  p,  and  p.  .  We  use  "f"  and  "d"  for  notational 

i  d 

ease,  and  to  remind  us  that  they  now  refer  to  "indications"  in  the 
analysis  of  this  more  general  decision  structure. ) 

For  a  given  fixed  setting  of  x*  ,  let  us  define  (P)  to 
be  the  minimum  expected  cost  obtained  by  using  the  optimal  truncated 
fixed  threshold  strategy.  Then  by  using  arguments  identical  to  those 
leading  up  to  equation  (3. 10)  we  may  write 


T  (P)  PC 


(l-P)C, 


6  SPSl 
n 

OSPS7 


(3.13) 


PW1+(l-P)W0+tPd+(l.P)f]  Pn.,  [p ]  ♦ 

+(P(1  -d)+d  -P)(1  .£)]  Fn  t  j 


where  we  have  made  use  of  the  fact  that 
prob.  {y=l )  =  Pd  +  (l-P)f 
prob.  {y=0>  =  P(l-d)+(l-P)(i-f) 
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p  ^ 

prob.  {SL  [ y=l  >  =  Pd  +  (i_p)f 

f  |  P  (1  -d) 

prob.  {S^y^}  =p(1.d)  +  (i.p )j!_£)  ^ 

It  is  important  to  repeat  the  fact  that  although  there  is  a  fixed 
detection  threshold  x*  that  leads  to  indications,  there  still  remain  the 
important  points  7n  and  6^  which  define  the  decision  regions.  These 
points  are  referred  to  as  decision  thresholds. 

In  the  limit  as  n-*"*5  ,  if  we  can  show  that  F^  (P)  converges, 
we  may  call  F  (P)  =  F(P). 

That  F^(P)  converges,  along  with  6^  and  7  .  may  be 
shown  exactly  as  in  theorems  A,  B,  C  and  D  of  section  3.  3.  1  with  the 
minor  changes  in  notation  necessary  to  allow  for  the  discrete  character 
of  y  . 

The  F(P)  and  associated  strategy  that  is  calculated  are  still 

conditioned  upon  the  threshold  x*  ,  in  that  both  d  and  f  are  functions 

of  x*  .  For  a  final  overall  cost  minimization,  we  may  select,  for 

every  P,  that  x*  which  minimizes  F(P)  .  Calling  this  overall 

optimum  F  .  (P)  we  see  that 
min 

Fmm(P)  =  •  (3.U) 

Quantization  of  the  continuous  target  information  into  a  binary 
variable  is  sure  to  produce  a  loss  in  the  information  sense,  but 
in  the  cost  context  of  the  present  problems  we  can  readily  compute 
the  loss  in  more  meaningful  cost  terms. 
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Successive  iterations  of  equation  (3. 13)  are  again  straight¬ 
forward.  and  computer  calculations  are  greatly  speeded  by  the 
absence  of  an  integral.  Figure  3.  5  shows  a  typical  convergence  to 
F(P)  for  a  particular  setting  of  x*  .  Figure  3.6  shows  the  application 
of  equation  (3. 14).  To  avoid  confusion  in  drawing,  only  some  selected 
curves  for  F(P)  are  shown,  but  in  general  F  .  (P)  is  the  lower 
envelope  of  all  F(P)  curves.  It  is  also  convenient  (and  illustrative) 
to  label  the  F(F'  curves  by  the  d  corresponding  to  their  througl 
equation  (3. 12).  The  values  of  the  parameters  have  been  selected  to 
provide  comparison  with  the  C  (P)  of  Figure  3.  2,  which  is  shown  in 
dotted  lines.  The  optimal  threshold  setting  (and  equivalent  d)  that 
results  from  this  example  is  plottrd  as  a  function  of  P  in  Figure  3.  7. 
Note  that  for  P  outside  some  range  there  is  no  optimal  threshold 
setting,  as  the  decision  rule  in  those  cases  calls  for  a  terminal 
decision  with  no  observation  taken. 


The  implementation  of  the  search  is  identical  to  that 
described  by  section  3.  4,  where  l#(x)  ii  replaced  by  L(y) 


(  h± 

i  -i 


My>-< 


y  ■  0 


y  -  1 


3. 6. 1  Numerical  Example 

On  occasion,  the  shepherd  of  our  previous  examples  has  to 
use  a  less  versatile  meter,  one  that  contains  a  built-in  threshold  (that 
may  be  set  at  the  start  of  the  search).  The  signal  and  noise  statistics 
are  still  characterised  by  the  p,  d.  f.  s. 
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p0  (*)  -  *N(x;  o,  i) 


P1  (x)  *  fN(x;  1. 1) 

so  that,  upon  setting  a  threshold  x*  ,  the  false  indication  and  detection 
probabilities  will  be 

f  =  1  -  erf  (x*) 
d  =  1  -  erf (x*-l)  . 


With  the  cost  values  =  1  ,  Wq  *  W.^  =  .  1  of  the  earlier 

examples,  the  shepherd  uses  Figures  3.6  and  3.7  to  obtain  the  following- 
decision  rule: 


1.  If  P  >  .61  ,  donlt  take  any  measurements  at  all,  and  go  directH 
to  the  pasture. 

2.  If  P  <  .  39  ,  don't  take  any  measurements  at  all,  and  go  directs 
to  town. 


3.  If  .  39  2S  P  2S  .  61  ,  set  the  threshold  as  indicated  in  Figure  3.  7, 

then  take  a  measurement  of  y  . 


4.  If  y  =  0  (x  <  x*)  then  compare 


1 -d  y  (1-P) 

y—r  with  the  values  -r. - r  ~=r — 

.  n  Pt  l-f  P 

. 64(1  -P)/P  ,  and  *  1 . 56 (1  -P)/P  .  If  y  *  1  (x» x*} 

then  compare  d fi  with  these  values.  If  the  likelihoods  fall 

outside  these  limits;  make  the  appropriate  terminal  decision. 

If  within  them,  take  another  measurement,  and  so  on. 


If,  as  before,  we  see  that  at  least  one  measurement  must  be  taken. 

From  Figure  3.  7  we  find  that  the  threshold  is  set  such  that  d  =  .68  , 
f  *  .  3  .  If  y  s  1  is  observed,  since  L  (1)  a  .  68/.  3*2.  27  is  greater 
than  1.56,  a  decision  is  required.  If  y  *  0  is  observed,  since 
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L  (0)  =  , 32/» 7  -  .  46  is  less  than  .  64  ,  a  decision  is  required.  This 
strategy  (for  P  =  1/2)  is  idential  to  the  STSD  strategy,  and  so  we  would 
expect  the  search  cost  at  this  P  to  be  the  same  as  the  STSD  cost,  which 
it  is.  However,  at  other  values  of  P  ,  the  advantage  of  the  sequential 
threshold  strategy  becomes  apparent. 

Illustrations  of  the  difference  in  costs  using  the  three  strategies 
mentioned  so  far  in  this  chapter  are  shown  in  Figure  3.  8.  The  curves 
show  the  percent  increase  in  cost  (over  the  minimum  cost  attainable  by 
the  optimum  sequential  search)  due  to  using  either  the  non -sequential 
STSD  strategy,  or  the  sequential  fixed  threshold  strategy.  As  shown  by 
the  figures,  the  importance  of  using  a  sequential  search  is  considerable 
when  the  experimental  cost  decreases. 

3.  7  Adaptive  Threshold  Sequential  Search 

The  previous  section  considered  the  threshold  x*  to  be  fixed 
throughout  the  search.  This  limitation  is  often  attributable  to  a  lack  of 
the  time  needed  to  adjust  the  threshold  (if  necessary)  inbetween  possible 
observation  periods.  In  some  systems,  however,  although  the  detection 
device  is  by  nature  a  threshold  indicator,  it  is  possible  to  vary  the 
threshold  inbetween  observations.  A  practical  example  is  the  use  of 
a  radar  FPI  scope  by  an  experienced  operator.  The  operator  usually 
keeps  the  gain  of  the  scope  low  (to  ayoid  "snow")  until  a  possible  blip 
shows  up  at  some  point.  The  operator  can  then  increase  the  gain  on  the 
next  sweep  because  he  will  be  concentrating  on  a  smaller  region  of  the 
scope,  (i.e.  the  neighborhood  of  the  possible  blip)  and  so  is  not  as 
effected  by  the  increased  noiee. 
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w0  *  W1  *  * 1 


- WQ  «  W  =  .01 


STSD  strategy 


sequential  fixed  threwtold  strategy 


\. 


r#Tti 


This  process  of  controlling  the  threshold  is  often  referred  to 
as  the  'alerting  effect *'  and  shows  up  in  our  model  of  sequential  search 
in  the  following  way.  Let  the  decision  structure  be  such  that  the  searcher 
is  required  to  take  two  observations,  and  that  be  must  make  a  terminal 
decision  after  the  second  observation,  both  limited  to  threshold  obser¬ 
vations.  If  he  determines  the  first  threshold  x*  on  the  basis  of  STSD, 
he  has  (with  =0) 

T  ,  i-P  (  cio 

Si 


The  resultant  first  observation  can  either  be  y  =  1  or  y  =  0  (indication, 
or  no  indication).  Now  suppose  y  =  1  is  observed.  Then  the  posterior 
’’target  present ”  probability  P'  is 


pp. 


P'  =  prob.  (Sj  |y=l } 


He  would  now  want  to  set  the  second  threshold  so  that 


Since  in  practical  systems  ,  we  see  that  L  (x*)  <  L  (x*)  .  With 

the  usual  monatonic  likelihood  function,  this  implies  that  x*  <  x*  ,  so 
th*  second  threshold  has  been  lowered.  Because  this  lowering  of  the 
threshold  between  first  and  second  observations  yields  a  corresponding 
tendancy  towards  deciding  the  target  is  present,  the  expression  “alerting  “ 
is  descriptive.  The  1  alert n  is  due  to  the  reception  of  y  =  1  (x^  >  x^)  . 

If  y  =  0  (xj  <  x*)  were  received,  there  would  be  an  equivalent  “dulling" 
effect  by  an  increase  in  the  second  threshold. 
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Thii  sort  of  adaptive  threshold  adjustment  may  be  carried 

over  into  the  framework  of  the  previous  problems,  and  calculations  can 

be  made  to  compare  the  resulting  mirimum  cost  with  the  solutions  of 

the  fixed  threshold  and  continuous  variable  searches  of  the  previous 

sections.  In  order  to  show  this,  let  us  define  K  (F)  to  be  the  minimurr 

n 

cost  attained  .by  using  an  optimal  truncated  adaptive  threshold  strategy. 
Then  we  may  write 


T(P) 


K^(P)  =  min/ 


min  Ln(P,x*) 


where 


Ln<P'**>  =  FWj  ♦  (1-P>r0  +  [Pd  ♦  (l-P)fl  Kn  l  [^—^7  ) 

+  [P(i-d)  +  (.-P)(l-f)]Ka  l 


The  dependence  of  L>n(P,x*)  on  x*  is  a  consequence  of  d  =  d(x*), 
f  a  f(x*)  as  defined  in  equations  (3. 12). 


Again,  calculation  of  K  (P)  is  straightforward,  but  rather 

n 

lengthy  due  to  having  to  explore  the  x*  variation  at  each  stage  of  the 
iteration,  rather  than  juat  at  the  limit  as  jn  the  previous  section.  The 
approach  of  K  (F)  to  the  limit  function  lt(P)  as  is  assured  by 

n 

simple  modifications  to  the  proofs  given  in  section  3.  3.  1.  The  resulting 
minimum  cost  function  and  strategy  is  aghin  similar  to  those  derived 
before. 


Although  calculations  of  K(F)  have  not  been  carried  out,  it 
is  possible  to  prove  the  intuitive  inequality 


C(P)S.K(P)SF  .  (P)  for  all  0  =£  P  <  1 
mm 

the  proof  is  not  instructive,  and  is  omitted  here. 
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CHAPTER  IV 


TARGET  ARRIVAL  AT  A  RANDOM  TIME 

4.  1  Towards  a  More  Realistic  Model  of  Certain  Searches 

All  the  previous  work  in  this  paper  has  been  carried  out  with  the 
condition  that  the  states  of  nature  describing  the  system  were  fixed  throup 
out  the  search.  This  was  a  basic  part  of  the  model  first  described  in 
Chapter  II,  and  it  is  only  on  this  basis  that  any  of  the  hypothesis  tests, 
sequential  or  otherwise,  could  be  applied.  In  the  analysis  of  sequential 
search  strategies,  this  condition  of  stationarity  of  the  target  can  be  seen 
to  be  a  very  limiting  one,  in  that  the  duration  of  the  search  is  a  random 
variable.  Thus  '’throughout  the  search"  means  all  that  time  for  which 
the  probability  of  the  search  duration  is  non-zero.  This  time  range  is 
often  infinite.  The  use  of  a  hypothesis  test  must  therefore  be  carefully 
considered,  and  should  be  used  only  when  the  only  alternatives  to  the 
states  of  nature  are  either  Sq  and  ,  and  when  only  one  of  these 
states  will  definitely  hold  for  the  entire  search. 

Many  practical  problems  which  have  been  analysed  from 
the  point  of  view  of  hypothesis  testing  are  much  more  reasonably 
approached  by  a  new  model  which  will  be  treated  in  this  chapter. 

This  model  allows  the  target  to  arrive  in  the  cell  at  some  time  t 
(a  random  variable),  after  the  start  of  the  search  and  then  remain 
there  for  the  rest  of  the  search.  The  decision  rules  should  then  be 
based  upon  the  probability  that  the  target  has  arrived  yet,  rather  than 
whether  or  not  it  is  present  at  all. 


This  sort  of  model  is  certainly  applicable  in  many  military 
detection  models,  and  the  term  "raid  recognition"  has  often  been  used 
to  describe  the  general  problem  statement.  When  searching  for  a 
submarine  in  a  particular  area,  for  example,  we  ■'•arely  have  the  luxury 
to  assume  that  it  will  either  be  there  or  not  for  the  rest  of  the  search 
(i.w.  time  of  interest).  In  particular,  when  the  decision  =  "Target 
not  present"  is  made,  the  search  equipment  would  not  be  turned  off. 

In  fact,  in  problems  of  this  type  the  decisions  become  limited  to  two 
decisions:  or  W  .  Let  us  continue  the  submarine  search  example 

to  develop  typical  actions  and  cost  factors. 

Suppose  decision  (t)  is  the  order  to  send  AS W  aircraft 
at  time  r  to  the  region  representing  the  cell  covered  by  the  detection 
device.  If  the  submarine  has  not  yet  arrived,  then  the  aircraft  must 
return  to  base,  some  resultant  cost  of  false  decision  has  been  incurred 
and  the  search  continues.  If  the  submarine  has  arrived  at  some  time  » 
(before  the  Dj  decision  was  made)  then  the  longer  the  delay  between 
t  and  T  the  more  difficult  an  eventual  interception,  and  thus  there  is 
an  increase  in  what  we  might  call  the  interception  cost  when  the  (t) 
decision  ie  made.  The  search  is  then  considered  to  be  terminated. 

One  obvious  objective  is  to  make  decisions  so  that  the  expected  cost 
of  the  search  ik  minimised.  This  chapter  will  consider  such  a  model, 
and  the  optimal  search  that  evolve  s  from  it. 

We  shall  also  spend  some  effort  in  the  development  of  a 
simple  way  of  comparing  some  specific  non-optimal  rules  that  are 
being  practiced  or  proposed,  by  use  of  a  concept  similar  to  the  ROC 
described  in  Chapter  II. 
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4.2 


General  Problem  Statement 


1.  The  target  arrives  in  the  cell  of  interest  at  time  t 
(t=0,  1,  .  .  . )  with  known  probability  p  (t). 

2.  If  the  cell  is  observed  at  time  t  ,  the  result  is  a  random 
variable  x  which  has  p.d.f.  (x)  if  t  >  r  and  (x) 
if  t  —  T  . 

3.  At  every  time  t  (t=0,  1,  .  .  .)  the  searcher  makes  one  of  the 
following  decisions 

D(t)  :  Decide  the  target  has  arrived 
W(r)  :  Wait  for  another  observation  . 

4.  The  decision  D{r)  may  or  may  not  be  a  terminal  decision: 
If  D(t)  is  picked  and  t  >  r  ,  then  a  false  decision  cost 

F  is  incurred,  the  knowledge  that  t  >  r  is  gained,  and 
the  search  continues. 

If  D(t)  is  picked  and  t  r  ,  then  the  search  is  terminated 
with  a  cost  of  $(t,  t)  . 

5.  The  objective  is  to  minimize  the  expect  cost  of  the  search. 
The  strategy  achieving  this  minimum  cost  is  called  the 
"optimal "  strategy. 

Although  we  shall  not  solve  the  general  problem  as  stated  above,  it  is 
helpful  to  keep  it  in  mind  when  solving  it  subject  to  reasonable  assump¬ 
tions.  Again,  as  in  the  previous  work,  we  shall  restrict  pQ  (x)  and 
Pj  (x)  such  that  L  (x)  =  Pj  (x)/pQ  (x)  is  monotonic  non-decreasing  in 
x  ,  to  ease  the  notation. 

4.  3  Linear  Terminal  Coat,  and  Geometrical  Arrival  Time 

Two  assumptions  will  now  be  made  in  order  to  obtain  a 
solution.  These  also  have  the  advantage  of  reasonably  representing 
some  real  search  situations. 


-73- 


First.  we  shall  assume  that  the  terminal  cost  is  proportion* 
to  the  time  'We  ",  i.  e.  $(t,  t)  =  (t  -t) W  .  This  form  of  the  function 
is  not  necessary  for  a  solution,  but  offers  a  minimum  of  algebraic 
difficulties  that  might  otherwise  cloud  the  development. 

Second,  and  more  restrictive,  we  shall  assume  that  the  tar/n 
arrival  time  distribution  is 

p{t)=A(l-X)t  t  =0,1,...  (4.  V 

This  geometric  distribution  has  the  advantage  in  describing  the  arrival 
as  being  conceptually  "random"  by  the  fact  that  it  provides  a  constant 
probability  of  arrival  per  unit  time  (A)  ,  given  that  it  hasn't  yet  arrive." 
This  property  also  provides  a  simple  representation  for  a  state  variabi 
by  summing  up  the  total  information  about  the  state  of  nature  (i.e.  wh:>. 
or  not  the  target  has  arrived).  This  is  outlined  as  follows. 

If  we  let  P(t)  be  the  probability  that  the  target  has  arrived 
at  or  before  time  r  ,  then  by  equation  (4.1) 

T 

F(t)  *£p(t)  «  1  -  <1-A)T+1  (4.2) 

t*0 

Furthermore,  we  can  chow  that 

P(t41)  *  (1-A) F(t)  4  A 

so  that  (without  any  other  information)  to  describe  F(r  +  1)  ,  ait  that 
is  needed  is  P(t)  . 

hi  addition,  suppose  that  a  value  of  x(t)  is  observed  at 
time  r  .  Then 

1  prob.  x (t ) 
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To  calculate  prob.  (x(t),  T  +  1)  ,  we  note  that  this  could  happen  in  two 
ways:  the  target  could  have  arrived  at  time  t  or  before,  in  which 
case  Pj  (x)  is  the  p.  d.£.  of  x(t)  ,  or  the  target  could  have  arrived 
at  time  t+1  ,  so  that  p^  (x)  is  the  p.  d.  f.  Thus 


P  (t  +  1  !x(t))  s 


[P(r)P1  (x)  +  [1  -P  (t  )]  A  pQ  (x)] 
P(r)p1  (x)  +  (1-P{r))p0  (x) 


(4.3) 


and  we  see  that  given  some  observation  x  ,  the  posterior  probability 
of  the  target  arriving  at  or  before  time  T  +  1  is  still  dependent  only 
upon  P(t)  ,  and  not  T  explicitly.  It  is  this  basic  Markov  property 
that  allows  us  to  proceed  now  in  a  way  similar  to  the  approach  in 
Chapter  III  (where,  we  recall,  the  Markov  property  of  the  successive 
likelihood  ratios  led  to  the  dynamic  program  approach). 


The  assumption  leading  to  equation  (4.  1)  also  enables  us 
to  characterize  the  start  of  the  search,  since  P (0)  =  A  .  What  is 
more  important,  we  note  that  according  to  statement  (4)  of  the  problem 
definition,  when  the  false  decision  {D(T)Jt>T}  is  made,  knowledge 
that  t  >  r  is  automatically  gained.  This  fact  and  equation  (4. 1)  lead 
to  P (t+1  |t>T)  =  A  . 


We  are  now  prepared  to  write  a  functional  equation  for  the 
minimum  search  cost.  Let  us  define  V(P(t)  )  as  the  minimum  search 
cost  obtained  using  the  optimal  search  strategy  at  time  r  where  P(r) 
is  the  present  value  of  the  probability  that  the  target  has  arrived  previous 
to  or  at  time  t  .  There  are  two  decision  choices.  One  is  D(r)  : 
decide  target  has  arrived,  with  resultant  probability  being  wrong  of 
1-F(t)  „  and  subsequent  cost  of  F  plus  what  the  continued  search 
will  cost  from  then  on.  The  other  is  W(t)  :  wait  for  more  information  x  , 
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in  which  case  the  "late"  cost  is  incurred  only  if  the  target  has  arrived 
(probability  P(t)  ),  and  the  search  continues  with  the  proper  posteriori 

.  i 

probability  given  by  equation  (4.  3).  The  minimum  cost  is  then  given  by 


V(P)  =  min 


(1  -P)(F+ V(A) )  :  D 


00 

PW+J[PPl  (x)+(l-P)p0  (x)]V 


-00 


Ppx  (x)+(1-P)p0  (x)A 

Pl(x)+(1-P)p0  (x)  :  ' 

(4.4) 


The  r  has  new  been  purposely  left  out  as  an  argument  of  P  and 
the  decisions,  since  this  equation  holds  for  all  t  and  only  P(t)  =  i- 
is  needed  to  express  the  right-hand  side.  In  what  follows  r  will  be 
left  out  except  when  necessary  to  avoid  ambiguity. 


have 


One  result  is  immediately  apparent.  By  letting  P-0  we 


V(0)  s  min  [F  +  V(A),  V(A)1  =  V(A)  for  F>0. 

This  tells  us  that  the  cost  of  search,  if  we  know  the  target  has  not 
arrived,  is  the  same  as  if  we  waited  one  time  unit  and  started  again. 
This  is  because  there  can  be  no  "late'3  cost  W  if  the  target  has  not 
yet  arrived. 

In  order  to  develop  a  feeling  for  the  solution  to  equation 
(4.4),  and  to  obtain  an  upper  bound  upon  V(P)>  the  next  section 
shall  consider  the  degenerate  case  getting  no  information  from  the 
observations. 
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4.  4  Optimal  Search  With  No  Information 

Suppose  that  pQ  (x)  s  p^  (x)  .  Then,  as  can  be  shown  by  equatior 
(4.  3),  an  observation  of  x  does  not  affect  the  posterior  evaluation  of  P  . 
In  this  case  the  observation  x  is  irrelevent,  and  the  searcher  gains  no 
information.  The  searcher  may  still  develop  an  optimal  strategy,  which 
now  consists  simply  of  either  waiting  one  time  unit,  or  deciding  the  targer- 
has  arrived.  Equation  (4.  4)  becomes 


As  noted  before,  V (0)  =  V(A)  ,  and  we  can  also  easily  see  that 

V(l)  =  0  • 

From  the  form  of  equation  (4.  5)  it  is  postulated  that  the 
structure  of  the  strategy  will  be 

if  P  >:  y  ;  X> 

P  ^  y  :  W 

where  y  is  a  decision  point  to  be  determined  as  part  of  the  solution. 

That  this  is  indeed  the  form  of  the  strategy,  and  that  it  is 
not  degenerate  (that  is,  0  <  y  <  1) ,  may  be  shown  by  the  following 
proof  by  contradiction.  (The  discussion  that  follows  car.  also  be 
shown  to  be  valid  for  the  more  general  equation  (4.  4).  Since  it  is  the 
form  of  the  proof  that  is  of  interest,  it  is  carried  out  in  this  less 
complicated  case.) 
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Let  us  define 

D(P)  =  (1-P)<F4  V(A)) 

G(P)  -  WP  4  V[F  4  (1  -P)  A  j  . 

A  sketch  of  these  functions  is  shown  in  Figure  4.1-  D(P)  is  a  straight 
line  with  D(0)  =  F  4  V(A)  =  F  4  V(0),  D(l)  *0  .  G(P)  has  an  unknown 
functional  dependence  on  P  through  V[P  4  (1-P)A|  *  but  it  is  continuous 
by  the  continuity  of  V(P)  -  The  boundary  values  are  known  and  are 
G(0)  =  V(A)  =  V{0),  0(1)  =  W  4  V(l)  =  W  .  Since  F  >  0  and  Vf  >  0  , 
then  G(1)>D(1)  and  G(0)<D(0)  so  that  G(P)  and  D{P)  must 
intersect  at  at  least  one  point. 

Suppose  G(P)  were  such  that  G(P)  and  D  (P)  intersect 
at  more  than  one  point  (in  Figure  4.  1  this  is  illustrated  by  the  dotted 
line),  say  y '  ,  y"  and  ym  .  Let  us  select  a  point  P'  such  that 

y'  <  P'  <  y" 

yM<  P' 4  (l-P’)A  <  y"'  (4.6) 

where  (V  ,  y**J  ie  a  D  region  and  [y"  ,  ymJ  is  a  W  region. 

Then  by  aquation  (4. 5) 

V(P')  -  (1-P'HFI  V(A)  )  <  WP'  4  V[P'  4  (1-2>')AJ  (4.7) 

and 


V(P' 4  (l-F')AJ  =  W(P' 4  (l-P')X)  4  V[P'4(1-P')A  4  (1 -P5)(l -A) A]  < 
<(1-P»)(1-A)[F4  V(A)]  . 


(4.8) 
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The  construction  of  V (P) 


Combining  equations  (4.  7)  and  (4.  8) 


(l-P'HF  +  V(X))  <  WP'  +  (1-P>)(UA) [r  +  V(X)] 
which  reduces  to 

P' > 

W  •  (4. 9) 

If  we  select  another  point  P"  so  that 


yH  <  pM  <  ylH 

y"'  <  P%  (1  -p'^x  <  1 


we  can  show  in  a  similar  manner  that 


P"<*J£±vjXLi 

w 


(4.  IOj 


which,  with  equation  (4.9)  implies  that 
(4. 6)  and  (4. 10)  require  P*  <  y"  <  p"  f 


P  <  P'  .  But,  since  equations 
the  contradiction  is  proven. 


We  have  just  shown  thsn  that 
point,  and  defining  this  point  as  P 


G  (P)  and  D  (P)  intersect  at 
■  y  ,  we  re-write  equation  (4.5) 


V(P) 


0-p)(r  ♦  v(A) ) 


WP4  V[P4(1-P)AJ 


P  St  y 
P  (S  y 


(4.11) 


As  a  first  step  in  the  solution  of  this  squation,  let  P  *  y  . 
Thsn  since  y4(l-y)\  >y  we  have  V(y)  «  (1  -y)<I>V(A)  )  «  Wy  4 

+  V<H<1'r,A)  *  +  0-7)(l-X)(F4V(A)  )  from  which  we  get 


F4V(A)«t1~  ¥ 
1-y  A 


<4.12) 
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so  that  equation  (4. 11)  becomes 

C1-p>T^t  PET 

V(P)=<j  (4.13) 

(wp+ V[P  +  (1-P)XJ  PSy 

The  next  step  is  to  find  y  in  terms  of  W  ,  F  and  A  .  Once 
this  is  obtained,  the  optimal  strategy  is  defined.  (Determination  of  the 
functional  form  of  V  (P)  for  P  y  will  then  r  ^ly  upon  iterations  of 
(4.  13)  in  a  manner  to  be  described  later.) 

To  determine  y  ,  let  us  assume  that  y  has  been  obtained 
and  is  such  that 

X  <  y 

1  -  (J-X)2<y 
1  -  U-X)1*"1  <  y 

1-(1-X)n2ty  (4.14) 

where  n  is  the  smallest  integer  such  that  equation  (4.  14)  holds. 

By  n-1  successive  applications  of  equation  (4.  13)  we  get 
V (X)  *  WX  4  V[1  -  (1-A)2) 

»  WA  4  W(1  -  (1-A)*)  4  V(1  -  (1-A)3] 

*  Wf(n-l)  -  (1  -  (1-A)*’1)]  4  V(1  -  (1  -A)n]  , 

all  (n-1)  steps  being  the  result  of  W-decisions.  Finally,  since  the 
n**1  must  be  a  D-decision: 

V(.i)  =  W[(n-1)  -  ~  (1  -  (l-*)11'1)]  +  <1-A)n^  ?  . 
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Using  the  value  of  V (A)  from  Situation  (4. 12)  we  may  solve  for  y  in 
terms  of  n 


7*1 


X(~  +  n) 


(4.15) 


By  use  of  equation  (4. 14)  we  find  that  n  is  the  smallest  integer  such 
that 


0-X)nS - -  (4.16) 

HA(~  +  n) 

Once  n  is  found,  y  is  then  obtained  from  equation  (4. 15). 

We  have  just  proved  that  the  form  of  the  strategy  consists 
of  waiting  for  a  fixed  amount  of  time  (number  of  time  units)  n-,1  ;  .then 
choosing  a  D-decision.  In  the  event  that  the  search  has  just  started, 

(  or  that  a  D-decision  has  just  been  made  but  the  target  has  not  yet 
arrived,  so  that  the  search  must  be  resumed  with  P  *  0),  this  fixed 
amount  of  time  is  given  by  equation  (4.  16),  from  which  y  can  be 
determined. 

U  the  eearch  starts  out  so  that  P  #  A  ,  then  by  successive 
applications  of  aquation  (4. 13)  we  can  show  in  a  calculation  similar 
to  that  abova  that  the  tims  until  a  D-decision  ,  n(P)  ,  is  g.’vtsm  by 
ths  smallest  n(P)  that  satisfies 

(l-P)(l-X)n(P)“1  *  1  -  y  .  (4.17) 

In  order  to  compute  V(P)  we  again  simply  apply  equation 
(4.13),  [n(P)-l]  times  with  decision  W,  and  the  n(F)**1  time  witn 
a  D-decision.  This  gives  the  following  form  of  the  minimum  cost, 
where  n  =  n(P) 
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V  (P)  =•  W  [(n  - 1 )  -  ~(1  *  (1-A)U_1)J  -f  (l-PHl-A)11"1  yl-  ? 

(4.  IS) 

When  the  mean  arrival  time  gets,  very  large  (so  that  A  1), 
an  interesting  approximation  .holds .  Since  n  gets  large,  'we  m^y 
consider  equation  {4.  lb)  to  be  an  equality,  and  as  A-*-0  we  find* 


1  +  A  (^r  +  n)  =  (1  -A ).~n  =  1  +  nA  +  A?  4-  ,  . 

W  c. 


so  that 


n (n+1)  a 


2F 

WA 


I 

or,  since  n  is  very  large,  n  —  ~\j 
equation  (4.  15)  becomes 


2  F 
WA 


With  this  approximation 


md  so  V(0)  =  V (A)  = 


2  FW 


from  equation  (4.  12).  Thus,  if  the 


search  always  starts  with  P  =  0  or  P  =  A  ,  this  expression  gives 
the  minimum  expected  search  cost. 


4.  1.  1  Numerical  Example  of  the  Non- Informative  Search 

In  this  section  we  shall  treat  a  simple  example  of  the  search 
analysed  in  the  previous  section.  The  solution  is  in  itself  interesting 
and  is  also  useful  in  order  to  compare  the  result  with  an  informative 
search  to  be  tieated  later. 
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ft  is  known  that  an  unfriendly  trawler  will  arrive  in  a  certair 
region  of  the  ocean  in  order  to  cut  some  trans -oceanic  cables.  The 
defense  of  this  region  is  carried  out  by  a  unit  that  can  dispatch  high 
speed  aircraft  to  the  region  which  are  able  to  identify  and  deter  the 
trawler.  If,  when  the  aircraft  are  sent  out,  the  trawler  is  not  present, 
they  note  this  fact,  return  to  base  and  a  flight  cost  of  $  10,  000  is  incur 
However,  for  every  half-hour  (unit  time)  period  after  tt.  arrives-  that .th- 
trawler  is  unchallenged  in  the  region,  it  does  $  1,  000  worth  of  dama/;, 
It  is  assumed  that  the  time  that  the  trawler  will  arrive  at  the  region  i- 
a  geometrically  distributed  random  variable  with  a  mean  of  5  hours 
(10  unit  time  periods),  so  that  A  =  .  1.  In  units  of  $10,000,  F  =  1 


and  W  =  .  1 . 


Using  equation  (4. 16) 


(•  9)n  =S  (2  +  .  In)'1 


we  find  that  the  smallest  n  th?t  satisfies  this  expression  is  n  =  11  . 
Thus  the  optimal  strategy,  given  that  the  trawler  is  not  present  at  the 
start  of  the  search,  is  to  wait  (n-1)  time  units  (5  hours)  before  sending 
out  the  aircraft,  and  repeat  this  procedure,  until  the  trawler  is  found. 

A  calculation  of  y  is  made  with  equation  (4. 15) 


,.1-L ^--673 


and  the  expected  cost  at  the  beginning  of  the  search  is 

V(0)  .  V(A)  -r 3-  £  -  F  ":1  =  1-06  . 

l-y  A  . 327  • 


If  P  is  the  probability  that  the  trawler  is  in  the  region  at 
the  start  of  the  search,  then  n(P)  and  V  (P)  may  be  calculated  by  mean 
of  equations  (4.  17)  and  (4.  18)  with  the  value  of  7  =  .  673  .  A  plot  of  V(P; 
is  shown  in  Figure  4.  2.  Note  that  V  (P)  consists  of  the  lower  envelope 
of  11  straight  lines.  These  represent  the  cost  of  search  if  the  strategy 
is  to  wait  (n-1)  units  of  time  before  sending  the  aircraft,  where 
n  =  1,2,  ...»  11  .  The  n  for  the  minimum  cost  at  any  P  is  thus  n(P)  . 
For  example,  for  P  2:  7  ,  n  =  1  and  equation  (4.18)  gives 

v(P)  =  x  (1'p)  =  ^•06<1-p)  • 

4.  5  Previous  Work  Relating  to  the  Non- Informative  Search 

Before  considering  the  informative  search  model  it  should  be 
noted  that  the  non* informative  problem  has  been  considered  in  the  litera¬ 
ture,  but  not  within  the  framework  of  search.  Barlow  et.  al  (2)  discuss 
this  problem  as  an  example  of  a  "checking"  procedure,  and  consider  its 
applications  particularly  to  problems  of  checking  equipment  that  is  subject 
to  random  failure.  Thus  the  D-decision  is  the  decision  to  check  the  part 
to  see  if  it  has  failed  yet,  with  some  appropriate  nuisance  cost  if  it  hasn't, 
while  the  W-decision  is  the  decision  to  wait  one  more  time  unit.  They  do 
not  attempt  to  treat  the  extension  of  the  problem  to  the  possibility  of  obser* 
ving  some  noisy  signal  associated  with  the  failing  part  (a  temperature 
reading,  for  example).  The  problem  is  defined  in  terms  of  continuous 
times,  so  that  n  is  replaced  by  t  ,  and  equation  (4.  14)  and  equations 
derived  from  it  become  equalities. 

What  is  an  interesting  comment  on  their  work,  however,  is  the 
fact  that  although  they  assume  a  general  form  of  the  arrival  distribution, 
they  never  prove  the  form  of  the  optimal  solution.  What  is  done,  in  effect, 
is  to  assume  that  the  optimal  checking  procedure  will  be  to  wait  some  time 
tj  ,  then  check,  if  no  failure  is  seen,  wait  some  time  »  check,  etc. 
They  go  on  to  show  that  for  the  exponential  failure  time  density  function 
(the  equivalent  of  out  geometric  mass  function)  these  t^  are  equal. 

The  assumption  of  the  (now  proven  correct)  fixed  checking  time  form 
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of  solution  enable*  them  to  imlmediately  write  a  coat  expression  in 
of  the  checking  time  t  ,  which  can  be  minimised  by  differentiation. 

The  dynamic  programming  approach  adopted  in  section  4.  2 
thus  has  the  advantage  of  providing  the  form  of  the  optimal  strategy, 
as  well  as  its  parameters.  In  addition,  as  will  be  seen  below,  we 
are  now  in  a  position  to  simply  extend  the  structure  to  allow  for 
observations  of  a  random  variable  relating  to  the  state  of  nature 
(whether  or  not  it  concerns  a  target  arriving,  or  a  piece  of  failing 
equipment). 


4.  6  Optimal  Search  With  Information 

We  are  now  ready  to  attack  the  problem  involving  observations 
of  a  random  variable  x  with  every  W  decision  First  we  note  that  the 
proof  in  section  4.4  concerning  the  form  of  the  non -informative  search 
may  be  carried  over  conceptually  to  the  more  general  case  represented 
by  equation  (4.4),  so  we  state  without  formal  proof  that  equation  (4.4) 
may  be  written 


V(P)  =  min 


D(P) 


H(P) 


rD(P) 

(P) 


P  fcy 
P  57 


(4.  19) 


where 


D(P)  =  (i-P)(F+  V(A)  ) 


r*>  /pp  (x)  +  (l-P)pQ(x)X 

H(P)  =  pW  +  J  [Ppj  (x)  +  (1-P)p0(x)]  vf  pPi  (x)  +  (l-P)p0(x) 
« so  \ 


dx 


We  are  now  faced  with  a  functional  equation  similar  to  that  treated  in 
Chapter  III.  Unfortunately,  however,  there  is  no  efficient  way  to 
intuitively  truncate  the  search,  as  was  done  in  that  case.  In  spite 
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of  this  lack  of  intuitive  truncation,  it  is  of  course  still  possible  to 
solve  equation  (4. 19)  by  such  an  iterative  procedure.  In  fact,  a  standard 
technique  for  solving  such  functional  equations,  and  many  transcendental 
equations,  is  simply  the  method  of  successive  approximations  used  in 
the  previous  section.  If  convergence  properties  can  be  shown,  then 
any  such  method  is  valid,  despite  the  non-physical  character  of 
intermediate  solutions. 


For  this  reason,  we  shall  re-write  equation  (4.  19)  with  V 
as  a  function  of  an  iteration  index  n  (that  has  no  particular  connection 
to  any  physical  index  of  the  search).  Doing  this  yields 


D  (P)  =  (1-P)(F+  V  .  (X)  ) 
n  n-i 


V  (P)  =  min" 
n 


(4.  20) 
Pp,  (x)+(l-P)p». 


/»*  /Pp  (x)+(l-P)p J 

Hn(P)  =  PW+jfPp,  (x)+(l-P)p0(*)]  v  J 

-00  \ 


All  that  is  needed  now  is  the  selection  of  the  boundary  condition  (P)  , 
and  assurance  that  successive  iterations  will  converge  the  process 
to  V(P)  as  .  If  we  let  Vq  (P)  *  0  for  all  P  ,  then 

Vj  (P)  -  min  [(1  -F)F  ,  P  W]  *  0  =  VQ  (P)  . 

With  the  fact  that  we  have  found  some  V  (P)  &  V  ,  (P)  ,  a  proof 

n  n-i 

very  similar  to  that  leading  to  theorem  B  of  section  3.  3. 1  allows  us 

to  show  that  in  fact  all  V  (P)  £  V  .  (P)  ,  so  that  the  process  will 

n  n  - 1 

approach  V(P)  from  below.  To  complete  the  convergence,  we  need 
to  show  that  V(P)  is  bounded  from  above.  This  can  be  shown  by 
noting  thnt 


V (A)  S  (1  -A)(F  +  V(A)  ) 
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so  that  V(A)  ^  F  and  therefore 

V(P)  ==  (1-P)(F+  V(A)  )  F  (4.21) 

Again,  as  in  Chapter  III,  it  is  easy  to  show  that  7^  , 
the  solution  of 


D  (7  )  =  H  (7  ) 

n  n  n  n 


converges  to  some  limit  7  as  n-^o°  . 

An  example  of  this  iteration  process  is  shown  in  Figure  4.  3, 
where  we  have  selected  the  values  of  the  parameters  to  compare  to 
the  results  of  section  4.4.1.  The  PqM  and  Pj  (x)  are  normal  dis¬ 
tributions  with  unit  variance  and  mean  of  0  and  p  respectively, 
where  p  is  the  signal  to  noise  ratio. 

These  calculations  were  calculated  on  an  IBM  7090.  As  can 
be  seen  from  the  figure,  the  convergence  is  much  slower  than  that  for 
the  problems  of  Chapter  III.  However,  the  computations  are  still 
essentially  additions,  with  an  appropriate  approximation  for  the  integral. 

The  general  solution  of  the  search  for  a  randomly  arriving 
target,  with  observations  of  an  appropriate  signal,  has  thus  been 
obtained,  with  the  minimum  cost  attainable  and  optimal  strategy 
comprising  the  solution.  As  pointed  out  in  a  previous  section,  this 
model  applicable  to  a  randomly  arriving  target  may  also  be  applied 
to  3uch  problems  as  a  randomly  failing  piece  of  equipment. 
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4.6.  1  Numerical  Example  of  Informative  Sequential  Search 

The  defense  unit  of  section  4.4.  1  has  decided  to  install  a 
sonar  apparatus  to  aid  in  the  search  against  the  trawler.  The  results 
of  integrating  sonar  signals  for  the  half-hour  unit  time  is  a  random 
variable  x  that  has  p.d.f.  Pq  (x)  if  the  trawler  has  not  yet  arrived, 
or  p^  (x)  if  the  trawler  has  arrived,  where  p^  (x)  =  f^(x;  0,  1)  , 
pj  (x)  =  f^(x;  1*1}  .  As  indicated  in  section  3.  5. 1,  this  is  the  case 

when  detecting  a  known  signal  .  additive  Gaussian  noise,  with  a 
signal-to-noise  ratio  ji  =  1  . 

From  Figure  4.  3  we  note  that  y  -  •  7$  .  This  is  higher 
than  the  value  of  .673  ,  and  indicates  that  the  availability  of  information 
will  let  the  searcher  be  less  quick  to  respond.  We  also  note  that 
V(0)  =  V(A)  —  .  58  ,  which  is  a  saving  of  close  to  50%  compared  to 
the  ion -informative  search.  The  strategy  that  gives  these  results 
follows. 


Suppose  P  =  A  =  .  1  to  start  the  search.  Since-  1  SS  y  =  .78, 
an  observation  is  required  at  the  first  time  interval.  Suppose  a  value 
x^  is  the  result  of  this  observation. 


The  posteriori  probability  P(x^)  that  the  target  h«.s  arrived 
is  now  given  by  equation  (4.  3) 


(,l)p1(x1)4(,l)(.9)p0(x1) 
P (*!>  “  "T 1)  pjlx'j")  T  {.  9)  P0  <Xj) 


(.l)exp(xrj)  +  .09 
.  1  exp{x1  --)  +  .9 


Comparing  this  with  y  ,  we  see  thAt  i' 


AXp (Xj 


b 


2:  27.  8 
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or 

Xj  5k  .  I  4  in  2?.  S  '*•  5. 9 

then  the  aircraft  should  be  sent  out.  If  not,  another  observation  »2 
should  be  made,  the  a  posteriori  probability  P(Xj  ,  x2)  should  be 
calculated  and  compared  to  y  ,  etc. 

4.  6.  2  A  Comment  on  The  Solution 

As  is  Shown  by  the  example  in  Figure  4.  3,  although  con¬ 
vergence  of  V^O3)  t0  V(P)  is  guaranteed,  the  speed  with  which 
the  process  converges  is  rather  slow.  In  fact,  as  X  gets  very  imah, 
causing  the  cost  of  search  to  increase,  the  convergence  is  even  slower 
This  unfortunate  practical  difficulty  la  at  present  unresolved.  One 
possible  approach  i*  suggested  here. 

We  decided  in  section  4. 6  to  start  the  iteration  with  (P)  = 
which  consequently  assures  convergence  from  below.  It  is  equally 
possible  to  start  the  iteration  at  come  appropriate  large  value,  which 
will  again  assure  convergence,  but  than  from  above.  One  such  value 
would  be  the  right  hand  eide  of  the  condition  given  in  equation  (4.  21), 
i.e.  Vq  (P)  *  jj*  .  However,  a  lower  starting  point  is  available  by 
noting  that  the  minimum  cost  o'J  the  informative  search  is  less  than 
or  equal  to  tha  minimum  cost  of  tha  non -informative  search,  for  all 
values  of  P  .  This  lbwer  starting  value  of  (P)  could  considerably 
decrease  the  number  of  iterations  needed  to  provide  a  given  degree  of 
accuracy. 

Other  techniques  for  sstablishing  a  reasonable  first  guess 
of  V  iF)  ,  and  Jetting  this  equal  Vq  (P)  ,  Would  be  a  valuable  . 
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aid  in  the  computation.  In  general,  however,  convergence  proofs  might 
be  difficult  for  arbitrary  starting  functions  (P)  . 

4.  7  Implementation  of  the  Strategy  and  Comments  On  the  Geometric 

Arrival  Assumption 

In  Chapter  III  we  showed  that  the  use  of  the  optimal  search 
strategy  resulted  in  essentially  a  Wald  sprt,  where  the  decision  boundaries 
were  determined  by  cost  considerations  rather  than  by  error  probabilities, 
A  similar  analysis  of  the  implementation  of  the  strategy  developed  in 
section  4.6  is  of  interest,  in  that  it  points  out  a  basic  limitation  to  the 
treatment  ~>f  the  problem. 

From  the  form  of  the  decision  structure  presented  in  equation 
(4.  18),  we  see  that  P  ,  the  probability  that  the  target  has  arrived  up  to 
some  time,  is  constantly  compared  to  some  decision,  threshold  y  .  When 
a  D-decision  occurs,  P  automatically  returns  to  0  if  the  target  has  not 
yet  arrived.  With  a  series  of  W-decisions,  however,  a  series  of  obser¬ 
vations  .  .  .  has  been  made,  and  the  posterior  probability  of  the 

target  having  arrived  can  be  derived. 

Specifically,  let  us  consider  the  search  to  start  with  a 
W-decision  at  t  =  1  ,  and  that  n  successive  observations  of 
x^.x^,  . .  .  ,xft  are  made.  We  shall  also  consider  a  completely  general 
arrival  time  distribution  f(t),  t=l,2,  ...  (we  define  f  (0)  =  0)  .  Using 
the  statement  of  conditional  probability,  where  x  -  (x^x^,  •  •  •  >  xn)  , 
we  define 

P  «prob.{t  Sn|x)  =  ESfeJtlgd  f°b'  . 

n  prob.  {x} 
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The  unconditional  probability  of  receiving  some  vector  x  is 

n  n  n 

prob.  {x}  =  f  ( 1)  t5l  P1  ^xi  >  *  *  ^  p0  *Xi  *  i=2  P1  ^  *  {  ^  po  *X1  *  p0  *x2*  i?3  PJ 

♦  ••  •  *w(ft  p0 (Xj))  p,  <«„>  +£*«) (jSi  p0i*i>) 

'  7  j=n+  '  7 

ti  f'\  \  /  n  /  n 

-f  “I  &  h  <*i> + !-'<»(&  po  «»,>)  (lj  Pj  'v)  +If  (& p,  '• 

j=2  '  7  X  7  j=n+l  X 


(4.  22) 


Similarly,  we  may  calculate 


n  V  /j"1  \/  n 

prob.  {x|t2Sn}  •  prob.  {t:Sn}  =  f  (1)  Pi  +  i5l  Po  ^iVl  l5j  P1 


(4.  23) 


The  condition  for  a  D-decision,  given  that  x  h*s  been  observed,  is 
that  2  7  .  Using  the  above  expresrions  this  condition  becomes 

<*i> p« <v) fc pi  «v) a  £'<j) ft  po ft  • 

j*2  >  '  \  7  j*n+l  X  7 

n 

Dividing  by  JjJj  pQ  (x^)  givos  the  condition 


j*l  \  •'  j«l  \  /  jsn+1 


(4.  24) 


where  I;  (x^)  is  the  likelihood  ratio  for  each  observed  . 

We  note  now  an  interesting  relation  between  this  search  and 
the  one  developed  in  Chapter  III,  and  described  in  section  3  4.  If  we 
assume  that  f(t)  is  such  that 

f (1)  =  P 

f(j)  =0  j  <  » 

f{«)  •-  1  -  P 


(that  is,  the  target  either  arrives  at  t=i  with  probability  P  ,  or  it 
never  arrives  so  that  f(°°)  =  1  -P)  ,  then  the  condition  (4. £.4)  becomes 


1  -y 


1_-P 

P" 


which  is  precisely  the  condition  needed  in  the  sprt  to  reach  a 
decision.  We  see  then  that  we  have  not  only  a  sprt,  btst  one  with 
no  lower  decision  point,  which  implies  that  Wq  =  0  (see  the  proof 
of  theorem  D  and  the  definition  of  P).  We  have  shown  then  that  a 
certain  class  of  the  problems  described  by  equation  (3.  2),  namely 
those  for  which  Wq  =  0  ,  are  imbedded  in  the  general  solution  uf 
equation  (4.  4). 


Returning  to  the  problem  at  hand,  in  the  general  case 

represented  by  equation  (4,  24),  we  can  no  longer  describe  the  test 

that  develops  as  a  simple  random-walk.  In  fact  the  equation  repre- 

n 

sents  a  very  complicated  process.  Not  only  is  the  ,n  JL  (xj  term 
weighted  by  the  f  (j)  ,  thiB  weighting  is  successively  compared  to 
a  term  which  gets  smaller  as  n  increases.  Since  the  simple 
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random -walk  with  constant  absorbing  barriers  has  not  been  fully  solved 
(see  Appendix  A),  there  is  no  reason  to  believe  that  this  non -Markovian 
(because  is  more  than  just  a  function  of  process,  with  non¬ 

constant  barrier  would  be  any  easier.  Thus  a  strictly  Wald-type  approaci 
depending  as  it  does  upon  the  statistics  of  such  a  process,  would  not  seei» 
too  profitable. 

U  we  let  f  (t)  =  X  (l-A)*  *  (t  *  1,2,...},  however,  an  interest*  a. 
result  is  shown.  Equation  (4.  24)  becomes 


^XO-X)1'1  .|mx.)  !l-X)n 

j=l  j-n+1 


and  by  defining  A(x^)  =  L>(xJ/ (1-X)  we  get 


Is j  A(V 


0-7)  A 


which  has  the  advantage  of  being  a  test  that  compares  a  variable 
n 

Zn  *  £  Si  A(xi> 
i-1 

to  a  constant  decision  threshold. 

2n  addition,  the.  sequence  describes  a  Markov  process  in 

that  Z  . ,  only  depends  upon  Z  (as  well  as  A(x  ,  of  course).  To 
nr  i  n  n+ 1 

•how  this  we  note  that 


Zn+1  *  ZnA(xn+l>  +  A(xn+1> 


which  can  be  verified  by  direct  substitution  into  the  definition  of  Z  , 

tl 
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The  geometric  distribution  of  arrival  times  thus  imparts 
a  Markov  character  to  the  decision  process.  And  indeed,  it  is  just 
this  character  that  has  allowed  us  to  approach  the  problem  from  the 
dynamic  programming  point  of  view.  By  allowing  the  argument  of  V 
to  be  P  (and  similarly  the  argument  of  C  in  Chapter  III),  we  have 
been  assuming  that  P  is  completely  descriptive  of  the  searcher's 
state  of  knowledge  about  the  system,  and  that  the  history  of  events 
that  lead  to  P  are  unimportant.  Conversely,  since  for  the  general 
arrival  time  distribution  it  cannot  be  shown  that  equation  (4.  24)  re¬ 
presents  a  Markov  process,  we  cannot  write  a  general  equation 
similar  to  equation  (4.4)  as  P  alone  is  not  sufficient  to  represent 
a  ’'state11  of  the  process. 

4.  8  Fixed  Threshold  Sequential  inarch 

As  a  prelude  to  the  next  sections,  we  shall  consider  here 
the  optimal  search  with  the  use  of  threshold  observations  defined  in 
section  3.6.  Again  we  shall  consider  the  comparison  of  x  to  some 
fixed  threshold  x*  which  remains  constant  throughout  the  search. 
Defining,  as  before 

00 

f  B  f  (x*)  =  jpQ  (x)  dx 

X* 

d  =  d(x#)  =  \p.  (x)dx 

'J  1 

X* 

we  may  write  an  equation  similar  to  equation  (4.4),  with  R(P)  here 
representing  the  minimum  expected  cost  obtained  by  using  the  optimal 
fixed  threshold  sequential  strategy,  and  the  observations  consist  of 
V  =  1  (xetx*)  and  y  =  0  {x^x*)  . 
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(1-P)(F+R(A)) 


R(P)  =  min 


PW  *  [p  d  +  u  -P)  £1 R  f  i^Vjnpff) + 


+  [P(l-d)  ♦  (l-P){l-f)]R 


P(I-.d)  +  (1  -P)(l  -f)  A 
Pd  +  (l-P)(l-f) 


(4.  25) 


Aa  in  the  previous  sections  we  shall  compute  R  (P)  =  Rf  (P) 
using  the  equation 


R  (P)  =  min 
n 


A  (P)  A  (P) 
n  n 


B  (P)  B  (P) 
n  V  n 


P  ^  7 
P  £  7 


(4.  26) 


where 


A  (P)  a  (1-P)(F  +  R  .(A)) 
n  n-i 


»n(F)  - 


pw t  [Pd  +  (* -P) £]  Rn.i 


and 


VP>“0  . 

Again,  we  note  that  R(P)  is  a  function  of  x*  through  d 
and  f  ,  and  so  the  final  optimisation  takes  place  for  each  P  (and,,  in 
particular,  for  P*l)  such  that 

Rm,„<p)  =  ?in[R<p)]  . 

mm  x* 
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An  example  of  such  a  calculation  is  shown  in  Figure*  4.4,  4.  5, 

and  4.6  with  values  of  W  ,  F  ,  X  and  similar  to  those  used  in  previous 

examples  for  the  no  observation  case,  and  for  the  continuous  x  case. 

Figure  4.4  shows  the  convergence  of  R^tP)  to  R  (P)  for  a  particular 

x  value.  Figure  4.  5  is  a  plot  of  R{A)  as  a  function  of  x*  (and  thus 

a  function  of  d),  and  Figure  4.  6  is  a  plot  of  R  .  (P)  as  a  function  of  P  . 

min 


4.  8.  1  Numerical  Example  of  Fixed  Threshold  Sequential  Search 

We  return  to  the  search  for  the  trawler  of  the  previous  example, 
and  assume  that  the  sonar  device  has  a  built-in  threshold  which  must  be 
set  permanently  before  the  search  starts.  Given  the  same  cost  para¬ 
meters  as  before,  we  see  from  Figure  4.  5  that  if  the  search  starts  with 
P  =  A  the  threshold  x*  must  be  set  to  equal  x*  =  .88  so  that  d  =  .  55  , 
f  =  .19  .  Using  this  value  of  the  threshold  we  see  from  Figure  4.6  that 
7  =  .75  and  V(A)  =  .  67  ,  whicl  is  a  higher  cost  than  the  search  with 
observation  of  continuous  x  ,  but  a  lower  cost  than  obtained  in  the  non- 
informative  search.  These  other,  two  costs  are  also  illustrated  by  the 
other  lines  in  Figure  4.6. 


The  fixed  threshold  strategy  that  results  is  as  follows.  Suppose 
that  the  search  starts  with  P  =  A  .  Since  A  =  .  1  <  7  =  .  75  ,  an  observa¬ 
tion  y^  should  be  taken.  The  two  possible  output  results,  are  yj  -  0 
(x  S  x*)  and  y^  =  1  (x  2:  x*)  ,  which  produce  the  a  posteriori  probabilities, 


(,l)(l-d)  +  (,l)(.9)(l-f)  _ 
(•  1)(1  -d)  +  (.  ,9)(1  -f) 


=  0 


P 


(,  l)d  +  (,  1)(,  9) f 
{.  l)d  +  (•  9) f 


=  .  32 


yl  s  1 
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.ation 


H - 1 - - 1 - L~f 

.4  .6 

le  of  R  .  (P)  ,  compared  with 
min 

hout  information 


Since  both  these  values  are  less  than  y  -  *  75  ,  another  measureme 
should  be  taken.  Now  the  a  posteriori  probability  depends  upon  the 
sequence 


(.  1 5)(1  -d)  +  (.!)(.  85)(1 -f)  _  l8 
(.  15)(1  -d)  +  (.  85)(1  -f)  '  “ 


(y1>  y2)  ~ 


(.  15)d  +  (.  1)(.  85)  f 
(.  15)  d  +  (.  85)  f 


P  =< 


l 


{.  32)(1  -d)  +  (.!)(.  68)(l-f) 
(,  32)(l-d)  +  (.  68)  (1  -f) 


(.  32)  d  4-  (.  1)(.  68)  f 
(.  32)  d  +  (.  68)  f 


.28 


61 


<yry2}  =  (°,i. 


(yl  -  Y2)-  =  (!» 


= (1 


which  are  all  still  less  than  .  75  .  However,  we  see  that  the  observ 
with  y2  =  1  (i.e.  (0,  1)  and  (1,  1)  )  have  a  higher  P.  We  would  ex- 
that  in  these  cases  a  y^  =  1  observation  would  result  in  P  >  .  75  , 
in  fact  this  is  so.  Thus  the  sequential  strategy  develops  sequences  t 
observations  which  lead  eventually  to  the  decision  to  send  the  aircra 


4. 9  Analysis  of  Some  Non- Sequential  and  Non-Optimal  Decisio 

itules;  Practical  Considerations 

Although  the  threshold  search  process  of  the  previous  sect 
involves  a  simpler  equation  than  the  one  where  continuous  x  is  me  a 
the  implementation  still  require  *  calculation  of  a  weighted  sum  of  lb 
lihood  ratios  and  comparison  with  a  threshold,  just  as  in  the  earlier 
sections.  For  this  reason,  the  required  decision  making  demands  a 
sophisticated  arithmetic  capability,  a  condition  often  lacking  in  pract 
situations. 


A  study  of  some  non- sequential  (and  therefore  non -optimal) 
decision  rules  that  have  the  advantage  of  being  simple  to  use  are  now 
considered  using  the  model  of  a  randomly  arriving  target  derived  in 
this  chapter.  In  particular,  we  would  like  to  evaluate  some  of  the 
'classical"  rules  under  the  condition  that,  although  they  were  designed 
for  hypothesis  testing,  they  are  applied  to  the  randomly  arriving  target. 

The  non -sequential  STSD  rules  of  section  (2.4)  are  of  particular 
interest.  Suppose  that  they  are  used  in  repeated  situations,  so  that  at 
every  time  interval  decision  or  Dq  is  made,  but  that  the  probabi¬ 
lity  P  of  "target  present"  is  always  the  same  at  each  situation.  We 
consider  the  search  to  continue  until  the  correct  decision  {D^  (s^} 
is  made,  at  which  time  it  terminates.  If  we  wish  to  consider  this 
within  the  structure  of  the  randomly  arriving  target  model,  we  see 
that  it  is  the  equivalent  of  assuming  the  rule: 

for  any  x,  ,  if  x.  ix*  :  D 
1  i 

*{  :s  x*  :  W 

or,  in  words,  take  action  on  the  first  measurement  that  exceeds  some 
fixed  threshold  x*  .  Let  us  investigate  how  x*  is  obtained. 

With  STSD,  C  (P)  ,  the  cost  per  decision  is  minimized,  to 
get  x*  ,  where 

C (P)  «O-P)C10f+PC01(l-d)  (4.27) 

(from  equation  (2.  4),  with  f  ■  pf  ,  d  ■  pd  ,  =  0). 

The  cost  structure  of  the  randomly  arriving  target,  however, 
requires  a  minimization  of  the  cost  of  the  entire  search  V(P)  .  The 
cost  of  search,  given  that  the  target  arrives  at  time,  t  and  using  the 
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above  strategy?  consists  of  two  terms.  The  first  is  the  cost  of  false 
decisions  that  occur  until  t  ,  an  average  of  f  per  unit  time.  The 
second  term  is  due  to  the  expected  time  t  needed  to  make  the  correct 
^-decision  after  the  target  arrives,  which  is 


00 


T  =(1 


The  total  cost  V(t)  given  arrival  time  t  is  thus 

V  (t)  =  f  Ft  +—••'.  , 

a  - 

and  the  average  total  cost  V  is 
00 

V  =  £v(t)p(t)  =  fFt  +  \. 

t=0 

where  p(t)  is  the  arrival  time  distribution.  By  saying  that  P  is  the 

same  for  each  observation  we  have  assumed  that  p(t)  =  Ftl-P^  ,  so 
1-P 

that  t  =  — p-  and 

V(P)  =  V  =  f  F(~£)  +  ’  \  '  (4.  28) 

*  a 

We  have  seen  before  that  the  operating  point  when  the  cost 
is  expressed  by  equation  (4.  27)  is  defined  by  the  point  on  the  ROC 
where 


d(d) 

df 


1-P 

P 


By  minimising  the  cost  assumed  in  equation  (4.28).,  however, 
we  have  for  an  operating  point 


d  (d)  2  UP  F 

df  "  P  W 


(4.  29) 
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If  we  observe  that  F  and  W  are  the  exact  analogues  of  the  costs 
and  Cqj  respectively  (that  is  =Es  cost  of  {d|Sq}  ,  and  Cqj  =  W  = 

cost  of  {w|s^}  )  we  see  that  the  operating  point  expressed  by  equation 
(4.29)  required  a  smaller  value  of  d(d)/df  on  the  ROC,  and  so  operation 
ate  higher  d  and  f  .  Seer  for  example,  Figure  2.2. 

What  is  most  interesting,  however,  is  that  for  the  randomly 
arriving  target,  the  two  variables  of  detection  that  enter  into  the  cost 
of  search  are  the  probability  per  unit  time  (or  "rate  ")  of  making  false 
alarms  and  the  expected  time  until  detection  after  the  target  arrives. 

For  the  simple'  decision  rule  analysed  above,  the  false  alarm  rate  was 
simply  equal  to  the  false  indication  probability  f  ,  since  only  one  indi¬ 
cation  was  required  for  a  D-decision.  Similarly,  r  ,  the  average  time 
until  detection  after  the  target  arrives  is  simply  related  to  d  since  t  , 
the  time  until  detection  after  the  target  arrives  is  geometrically  diotri- 
buted  with  parameter  d  ,  and  hence  T  =  (~ ).  In  ghneral,  however,*  , 
we  eee  that  for  the  model  of  the  randomly  arriving  target  we  can  charac¬ 
terise  the  search  system  (detection  device  plus  a  decision  rule)  by 
means  of  a  couple  (♦,  t)  ,  where  $  is  now  defined  as  the  false  alarm 
rats.  We  recall  that  a  falee  alarm  ie  the  decision  to  decide  the  target 
has  arrived  when  in  fact  it  has  not. 

This  characterisation  of  a  system  by  ($;  r)  will  be  shown  to 
be  entirely  analagous  to  the  characterisation  of  a  detection  device  by 
the  (Pj,  pg)  couple  defining  the  ROC,  as  in  Chapter  II. 

4, 10  The  System  Operating  Characteristic  (SOC) 

Just  as  in  STSD  the  receiver  operating  characteristic  (ROC) 
was  a  useful  concept  in  compering  and  evaluating  detection  devices, 

(sse  ssetion  2.  3)  a  claim  is  now  made  for  the  use  of  a  similar  concept 
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for  the  comparison  and  evaluation  of  certain  search  systems.  By  search 
system  we  mean  a  detection  device  used  in  conjunction  with  a  decision 
rule.  Although  the  ROC  can  be  used  to  compare  search  systems,  it  can 
do  so  only  if  they  conform  to  the  same  set  of  decision  rules,  which  in 
turn  lead  to  some  equivalent  p^  and  p^  . 

The  System  Operating  Chara  t  fistic  (SOC)  that  is  proposed 
here  is  simply  a  plot  of  t  v.  s.  <j>  for  any  search  system.  By  $  we 
now  meii  that  fraction  of  time  that  the  system  produces  the  wrong 
decision  {d|s^}  ,  and  by  T  we  mean  simply  the  expected  time  to 
reach  the  decision  (d|s  }  after  the  arrival  of  the  target.  For  the 
randomly  arriving  target  model  the  SOC  may  be  used  for  both  qualitative 
and  quantitative  comparison  of  different  systems,  just  as  the  ROC  was 
used  to  compare  detection  devices  for  the  stationary  target  model. 

For  example  of  the  qualitative  aspects,  let  us  consider  two 
systems  and  S2  with  operating  points  (r^  ,  4^)  and  (t^  ,  <i>2)  • 

If  t.  <  t_  and  <}>,  <  <|>,  ,  .then  S,  is  preferable  to  S_  ,  and  so  the 
direction  of  preference  on  the  SOC  is  '/down  and  to  the  left'1.  In  order 
to  obtain  a  SOC  it  is  of  course  necessary  to  give  the  decision  rule  as 
well  as  the  characteristics  of  the  detection  device.  The  SOC's  in 
Figure  4.  7  are  the  equivalent  of  the  ROC's  given  in  Figure  2.  2,  i.  e. 
for  the  detection  of  a  known  signal  in  additive  Gaussian  noise,  (with 
p  the  signal  to  noise  ratio),  and  the  decision  rule:  if  x  2:  x*  :  , 

if  x  ^  x*  :  Dq  at  every  observation.  Note  that  an  increase  in  p  is 
still  a  universal  improvement. 

Once  the  SOC  is  given,  the  optimal  operating  point  for  any 
decision  criterion  (not  only  the  "Bayes")  may  be  obtained.  As  shown 
in  the  previous  section,  the  cost  of  search  may  be  written  in  general  as 

V  =  <j>Ft  +  TTt  (4.30) 
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so  that  the  minimum-cost  (VBayes1^  operating  point  is  where 
d'f  Ft 

d<*>  ”  *  ir 

with  t  the  mean  arrival  time  of  the  target. 

If  the  Neyman- Pear  son  criterion  is  usei,  some  arbitrary 
maximum  allowable  false  alarm  rate  =  4>*  is  ixed,  and  the  . .  ' 
operating  point  determined  by  the  SOC  at  the  poi  t  4*  =  4>*!t  •  The 
use  of  the  SOC  for  this  criterion  is  very  well  illi  strated  by  the 
following  example.  Suppose  that  the  decision  rule  is  to  make  decision 
I>  only  when  there  are  indications  (x  2r  x*)  at  k  successive  times, 
and  the  detection  device  is  for  the  familiar  known  target  in  additive 
Gaussian  noise. 
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For  any  fixed  k  and  x*  there  result*  a  value  of  9  and  r  ,  and  by 
letting  x*  vary  from  minua  infinity  to  infinity  we  produce  the  SOC  for 
the  k-tn-a-row  system.  These  SOC's  are  shown  in  Figure  4.  8  for 
k  a  1  (the  STJSD  case),  k  =  2  ,  k  =  3  with  p  =  1  .  As  can  be  seen, 
the  value  of  k  to  be  used  depends  upon  where  in  the  r  -  $  plane 
the  operating  point  is  located. 

For  the  Neyman-Pearson  criterion,  it  is  often  useful  to 
present  ROC  as  a  plot  of  P^  .  V..  a.  p  for  the  given  desired  p*  . 
Similarly  we  may  draw  the  SOC  for  the  above  example,  for  a  given 

4*  ,  as  a  plot  of  r  v.  s.  p  ,  or  ,  more  convenient,  as  a  plot  of 

—  -8 
1/t  v.  s.  p  .  Figure  4.9  shows  such  a  plot  fcr  a  fixed  «J>*  =  10 

(This  value  of  critical  false  alarm  rate  seems  to  be  a  popular  one 

for  radar  search  system*.)  We  note  that  as  p  becomes  large  T~*k 

(■which  is  reasonable  since  it  still  requires  k  measurements  before 

a  D-decision  can  be  made)  and  the  k  =  1  rule  is  best.  However, 

as  p  becomes  small,  it  becomes  advantageous  to  use  the  2-in-a-row 

rule,  then  the  3-in-a-row  rule,  and  so  on. 
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CHAPTER  V 


CONCLUSION 


5.  1  Summary  of  Results 

The  objective  of  this  study  has  been  to  obtain  a  set  of  decision 
rules  for  use  in  certain  general  search  situations.  These  rules,  called 
optimal  sequential  search  strategies,  are  "optimal"  in  that  their  use 
insures  that  the  total  cost  of  search  is  as  small  as  possible.  They  are 
"sequential”  in  the  sense  that  at  any  point  in  the  search,  the  decision  rules 
are  affected  by  what  has  been  observed  up  to  that  point. 

The  principle  technique  employed  has  been  the  representation 
of  the  search  process  in  terms  of  a  stochastic  dynamic  program.  This 
method  not  only  provides  the  form  of  the  optimal  strategy,  it  also  produces 
as  a  natural  consequence  the  minimum  cost  attained  by  using  6uch  a  strategy. 
This  allows  an  immediate  quantitative  comparison  of  the  results  with  any 
non-optimal  strategy  for  which  the  cost  is  available. 

The  models  that  were  treated  involved  the  use  of  detection  devices 
that  are  imperfect,  in  the  sense  that  a  target  that  is  present  might  go  unde¬ 
tected,  or  a  "detection"  might  occur  when  the  target  is  in  fact  not  present. 
The  latter  error  is  referred  to  as  a  false  alarm,  and  it*  consideration  is 
what  distinguishes  the  present  work  from  most  of  the  published  literature 
in  search  theory. 

5.  1.  1  Optimal  Sequential  Search  for  a  Stationary  Target 

In  this  model,  discussed  in  Chapter  III,  the  target  is  considered 
to  be,  for  all  time,  present  in  the  region  of  search  with  probability  P  , 
or  not  present  with  probability  (1-P)  .  Knowing  the  value  of  P  ,  at  fixed 
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intervals  of  time  the  searcher  must  either  make  a  terminal  decision  or 
make  a  measurement  of  a  random  variable.  The  distribution  function  of 
this  random  variable  depends  upon  whether  or  not  the  target  is  actually 
present.  The  two  terminal  decisions  involve  taking  action  corresponding 
to  the  conclusion  that  the  target  is  present,  or  that  the  target  is  absent. 
The  search  it  then  ended,  and  a  cost  is  incurred  if  the  conclusion  reached 
is  wrong.  If  a  measurement  is  taken,  a  cost  (which  depends  upon  whether 
or  not  the  target  is  present)  is  incurred.  The  measured  random  variable, 
however,  also  depends  upon  whether  or  not  the  target  is  present,  so 
information  about  the  target's  presence  is  gained  from  every  measurement 

Tie  optimal  strategy  (which  decisions  to  make  on  the  basis  of 
a  knowledge  of  P  at  the  start,  plus  possible  subsequent  measurements) 
and  minimum  cost  were  then  obtained  by  the  solution  of  a  functional 
equation.  This  functional  equation  may  be  solved  numerically  by  a 
method  of  successive  approximations  which  is  equivalent  to  treating 
the  equation  as  a  dynamic  program.  The  state  variable  of  this  program 
is  the  probability  that  the  target  is  present,  which  is  adjusted  by  Bayes' 
Rule  after  every  observation. 

Proof  of  the  convergence  of  such  a  program,  and  the  general 
form  of  the  strategy  that  arises  were  developed  and  examples  calculated 
for  interesting  cases.  In  particular,  it  was  shown  that  the  optimal  se¬ 
quential  strategy  is  similar  to  the  Wald  sequential  probability  ratio  test 
(sprt).  The  sprt,  however,  involves  the  a  and  p  errors  of  classical 
statistics  and  may  be  only  approximately  derived,  while  the  new  result 
offers  the  minimum  cost  directly,  and  provides  a  solution  to  any  desired 
degree  of  accuracy.  In  addition,  if  the  search  must  be  truncated  (stopped 
after  some  given  length  of  time)  the  dynamic  programming  solution  still 
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offers,  by  definition,  the  minimum  cost  search,  whereas  sprt  truncation 
solutions  often  are  quite  arbitrary.  The  results  were  also  compared 
with  the  minimum  cost  non-sequential  decision  rule  that  is  a  part  of 
the  Statistical  Theory  of  Signal  Detection. 

The  method  of  solution  also  facilitates  the  evaluation  of  certain 
non-optimal  decision  rules.  In  particular,  we  have  analysed  the  situation 
where  observation  of  a  continuous  random  variable  is  restricted  to  noting 
whether  or  not  it  exceeds  a  fixed  threshold.  The  extra  cost  due  to  this 
quantization  of  the  observation  was  then  obtained. 

The  method  developed  here  for  treating  this  problem  is  in  fact 
applicable  to  any  hypothesis  testing  situation,  where  the  cost  of  experi¬ 
mentation  depends  upon  the  state  of  nature,  and  the  a  priori  hypothesis 
probability  and  the  terminal  decision  error  costs  are  known. 

5.1.2  Optimal  Sequential  Search  for  a  Target  Arriving  at  a 

Random  Time'  ’ 

Chapter  IV  treated  a  search  situation  that  is  often  referred 
to  as  the  ’raid  recognition"  problem.  At  the  start  of  the  search,  the 
target  is  present  in  the  region  of  interest  with  an  a  priori  probability 

N 

P  .  At  successive  constant  time  intervals  thereafter,  if  the  target  is 
not  yet  present,  it  has  probability  A  of  arriving.  Once  the  target 
arrives  in  the  region,  it  remains  for  the  rest  of  the  search.  At  every 
time  interval,  the  searcher  must  either  decide  to  take  action  commen¬ 
surate  with  the  conclusion  that  the  target  has  arrived,  or  make  an 
observation  of  a  random  variable  which  has  a  distribution  that  depends 
upon  whether  or  no  the  target  has  in  fact  arrived.  If  it  is  concluded 
that  the  target  has  arrived,  and  it  hasn't,  the  searcher- is  so  informed 
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and  a  falsa  alarm  cost  is  incurred.  Altar  the  target  has  arrived,  a  cost 
is  incurred  that  is  proprotional  to  the  length  of  time  it  takes  to  conclude 
that  it  has  arrived. 

t 

The  optimal  sequential  strategy  and  resulting  minimum  cost 
ware  again  obtained  from  the  solution  of  a  functional  equation  of  the 
dynamic  programming  type .  The  state  variable  of  the  program  is  the 
probability  that  the  target  has  already  arrived.  This  is  adjusted  by 
Hayes'  Rule  after  every  observation,  or  set  to  aero  if  a  "target  has 
arrived"  conclusion  is  made  and  the  target  has  in  fact  not  yet  arrived. 

Comparison  between  Ihe  optimal  sequential  strategy  and  some 
non-optimal  strategies  was  made,  and  the  difference  between  them 
discussed. 

A  side  result  of  the  solution  has  been  the  development  of  what 
is  called  a  System  Operating  Characteristic.  This  is  simply  a  plot  of 
time  until  detection  (given  the  target  has  arrived)  against  false  alarm 
rate  for  any  search  system  (search  strategy  plus  detection  device). 

Its  use  is  analogous  in  the  randomly  arriving  target  search  to  the  use 
of  the  Receiver  Operating  Characteristic  in  the  hypothesis  test  search, 
in  that  it  allows  both  a  qualitative  and  quantitive  comparison  of  various 
detection  devices  and  decision  rules. 

R  has  also  been  noted  that  the  solution  is  valid  for  a  larger 
class  of  problems  than  simply  those  of  search.  For  example,  the 
method  developed  here  may  be  used  to  derive  optimal  checkins  stra¬ 
tegies  for  machinery  subject  to  random  failure  while  some  pertinent 
output  parameter  random  variable)  is  being  monitored. 
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5.  1.  3  Comments  on  the  Solutions ---Need  for  Further  Analysis 

The  major  results  of  this  study  have  been  the  formulation  of  the 
search  models  in  a  way  that  takes  into  account  their  intrinsic  sequential 
qualities.  Due  to  the  dynamic  program  form  of  the  resulting  equations, 
analytic  solutions  {except  for  the  non -informative  random  target  arrival 
time  search)  are  not  attainable.  An  additional  analytical  difficulty  has 
been  the  existence  of  the  (reasonable)  assumption  that  the  observed 
random  variables  are  normally  distributed.  Although  the  solutions 
can  be  obtained  numerically  for  any  set  of  parameters,  it  would  be 
interesting  to  obtain,  at  least  approximate  solutions  for  certain  limiting 
cases.  There  are  two  of  these  in  particular  that  the  author  has 
attempted,  with  (to  date)  little  results. 

The  first  concerns  the  limiting  behavior  of  C  (P)  ,  the 
minimum  cost  of  sequential  search  for  the  stationary  target.  In 
many  practical  cases,  P  is  very  close  to  zero,  while  the  false 
alarm  cost  is  very  much  greater  than  the  missed  target  cost 
(C1A»C_.)  .  How  does  C  (P)  vary  with  u  in  this  limit,  and  how 
does  the  strategy  vary? 

The  second  problem  has  to  do  with  the  very  slow  convergence 
of  the  calculation  of  V (P)  ,  the  minimum  cost  for  the  randomly  arriving 
target  search,  when  X  ,  the  probability  of  arrival  per  unit  time,  be¬ 
comes  small.  If  it  were  possible  to  start  the  iteration  at  come  reason¬ 
able  guess  for  V(P)  ,  the  convergence  would  become  that  much  quicker. 
The  problem  of  interest,  then,  is  to  obtain  some  reasonable  approximation 
for  V (P)  as  a  function  of  p  as  A-*0  .  In  particular,  an  approximation 
for  V(0)  =  V(A)  would  be  useful. 


In.  general,  any  results  that  can  produce  an  analytic  expression 
for  V(P)  and  C  (P)  ,  or  the  decision  regions  (7,  6)  and  ( 0,7 )  defining 
the  strategies,  for  any  (non-degenerate)  limiting  case,  would  be  worthwhile. 

5.  2  Suggested  Areas  of  Further  Research 

The  subject  of  sequential  estimation  as  an  element  of  search  was 
quickly  mentioned  and  dropped  in  Chapter  II.  This  very  difficult  topic  is 
related  to  the  problem  of  parameterization  of  Markov  processes,  one 
that  has  been  recently  attacked  by  Kramer  (28),  Using  his  results,  it 
might  be  possible  to  analyse  the  following  sort  of  problem.  A  stationary 
target  is  being  searched  for  as  in  Chapter  III.  The  signal -to -noise  ratio 
p  is  uncertain,  however,  and  is  in  fact  a  random  variable  with  (say)  a 
known  p.d.f.  g(p)  .  If  a  decision  is  made  that  the  target  is  present, 
then  an  estimate  of  p  must  accompany  this  decision.  An  appropriate 
cost  of  wrong  estimation  is  assumed.  What  is  the  best  sequential  strategy 
including  a  rule  for  estimating  p  ?  This  problem  could  be  set  up  in  a 
dynamic  programming  fashion  if  the  posteriori  p.  d.  f.  on  p  ,  given  an 
observation  x  ,  has  the  same  functional  formas  g(p)  .  However,  this 
rarely  is  the  case,  and  approximate  techniques  (such  as  those  developed 
by  Kramer)  are  needed.  It  is  interesting  to  note  that  some  approximate 
sequential  estimation  techniques,  based  upon  fiducial  probability  arguments, 
have  been  derived  for  this  sort  of  problem  (see  Turner  (44)  ). 

Although  the  Randomly  Arriving  Target  model  was  created  to 
represent  a  more  realistic  type  of  search,  it  too  has  a  basically  weak 
assumption — that  after  arriving  the  targ'  ^  e mains  in  the  region  for 
the  rest  of  the  search.  In  fact,  in  many  realistic  search  situations  the 
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target  is  continuously  appearing  and  dissapearing,  often  "at  random". 
Kimball  (25)  and  others  have  formulated  models  reflecting  this  effect, 
but  there  has  been  no  analysis  taking  into  consideration  false  alarms. 

A  related  problem  hac  been  studied  by  Drake  (12),  with  the  emphasis 
on  the  information-theoretical  aspects  of  the  "noisy"  observations  of 
such  an  appearing  -disappearing  model.  A  decision-theory  approach 
to  this  sort  of  situation  would  prove  fruitful  in  such  fields  as  submarine 
search,  detection  of  epidemics,  etc. 

A  final  suggestion  for  further  work  concerns  what  is  probably 
the  most  well  known  of  results  in  search  theory,  Koopman's  (26) 
solution  for  the  distribution  of  search  effort  over  a  continuous  field. 
Here,  the  target  location  is  represented  by  a  p.  d.f.  (on  a  line,  say), 
and  the  searcher  must  allocate.a  fbed  quantity  of  search  effort  along 
the  line  such  that  the  overall  probability  of  detection  of  the  tar~“t 
a  maximum.  The  detection  probability  at  any  point  x  is  assumed  to 
be  an  increasing  function  of  the  search  effort  placed  between  x  and 
x  +  dx  .  Suppose  now  that  there  is  in  addition  a  false  alarm  probability 
at  every  x  that  is  independent  of  the  target  behavior,  but  a  function  of 
the  search  effort  at  x  .  With  an  appropriate  cost  structure,  what 
should  the  distribution  of  search  effort  be  ?  The  author  has  pointed 
out  in  an  earlier  paper  (37)  that  the  no-false -alarm  solution  to  the 
discrete -cell  search  approaches  Koopman's  solution  in  the  limit. 

It  would  be  interesting  to  see  the  relation  between  the  results  in  the 
present  work  and  a  solution  to  the  problem  above. 


APPENDICES 


A.  Random  Walk  With  Absorbing  Barriers 

We  mention  here  some  results  concerning  the  one -dimensional 
random  walk  with  absorbing  barriers.  In  section  2.  5  we  noted  the  relation 

t 

between  the  sprt  and  this  problem.  Specifically,  if  we  let  z,  be  the 
logarithm  of  the  likelihood  ratio  L  (x^)  ,  and 


i=i 

then  the  test  comparing  Z  to  the  boundaries  a  and  b  is  such  a  random 

A 

walk.  ..  .  A  solution  to  th?.s>  problem  should  consist  of  (at  least)  the  proba- 

bility  that  the  walk  ends  with  absorption  at  each  of  the  boundaries  a  and  b  , 
and  the  expected  length  of  time  (number  of  samples)  needed  to  do  so,  under 
each  state  of  nature  and  . 

Wald  (47)  has  shown  the  following  results  that  lead  to  a  partial 

solution.  Let  the  ^  all  have  the  same  p.  d.  f.  (z)  under  the  state  of 

nature  S^(h=0, 1}  .  We  define  E^(€)  to  be  the  expected  value  of  random 

t  z 

variable  ?  under  .  Then  if  4^  (t)  *  E^ ie  )  is  the  moment -generating 
function  of  E^M^O  and  g^(e)  >  0  for  some  z>0  arid  some 


%  <  0  ,  then  there  exists  a  t  *  such  that 


V‘h>  ■ 1 


(A.  1) 


and  the  following  fundamental  identity  may  be  proven 


Eh{«  n[\(t>]  }  -  1 


(A.  2) 


where  n  (a  random  variable)  is  the  length  of  time  until  absorption. 
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Now  let  us  define 


n  (b,  a)  =  prob.  {z  :£  b} 
h  n 

1  -  II  (b,  a)  "  prob.  {Z  2s  a} 
h  n 


(A.  3) 


where  Z  is  the  value  of  Z  at  termination,  and  we  have  taken 
n 

advantage  of  the  fact  that  the  process  terminates  with  probability 


We  can  now  use  (A.  1)  and  our  knowledge  of  (z)  to 

calculate  t,  .  By  letting  t  =  t,  in  (A.  2)  we  also  obtain 
h  h 


E  (efck  ^n)  =  1 

h 


(A.  4) 


Now  let  us  suppose  that  when  the  walk  terminates,  it  does 
so  at  exactly  Zp  =  a  or  Z^  =  b  (i.e.  we  neglect  any  excess  of 
over  these  boundaries).  Then  we  may  write  {A.  4)  with  the  aid  of 
(A.  3)  as 

bth  ath 

Hh4b,a)e  +  [1  -  Hh(b,a)]  e  =1 


or,  solving  for  h^(b,  a) 

u-b)th  -bth 

h  . 
e  -  1 

Using  the  same  approximation  we  may  also  write 


(A.  5) 


£h(Zp)  =bnjb,a)  +  a[l  -  Iljb.a)] 
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and  we  may  easily  show  that 


Eh(Zn> 

V“>  =  w 


(A.  6) 


Identifying  H^(b,a)  with  1-p^  and  II  ^  (b,  a)  with  1-p^  leads 
directly  to  equations  (2. 13). 


The  above  formulae  hold,  unfortunately,  only  for  the 

restriction  used  above - that  the  walk  ends  only  with  a  jump  exactly 

onto  the  boundary.  When  (a-b)  is  very  large  compared  to  the  values 
of  x^  ,  then  we  see  that  this  assumption  is  reasonable.  However 
in  the  general  case  considered  in  Chapter  III  we  cannot  guarantee 
this  because  we  cannot  tell  beforehand  the  values  that  these  boundaries 
will  assume. 


Perhaps  a  more  direct  way  of  deriving  the  H  (b,a)  is  to 

h 

use  a  Chapman -Kolmogorov  equation  to  describe  the  walk  in  going 
from  one  step  to  the  next.  Thus 


The  solution  for  this  equation  is  not,  in  general  known.  A  most 
exhaustive  and  interesting  study  of  such  problems  has  been  recently 
undertaken  by  J.  H.  B.  Kemper  man  in  The  Passage  Problem  for  a 
Stationary  Markov  Chain,  University  of  Chicago  Press  (1961). 


B.  Decision  Regions  For  C  (P)  When  WQ  or  W^  Are  Zero 

Theorem  D  has  been  proven  for  the  case  where  the  experi¬ 
mental  costs  Wq  and  Wj  are  non-zero.  We  demonstrate  here  the 
proof  for  the  condition  Wq  =  0  or  W^  =  0  . 


1.  Let  us  take  "Wq  =  0  ,  >  0  .  Then  by  step  2  of 

theorem  D  we  still  have  6  ^  A<1  . 

n 


for  any  y>  -oo  . 

3.  (?)  above  becomes  a  strict  inequality  if 
Let  us  assume  this  is  so. 


y 

Jg(x)dx>0. 

-CO 


4.  Let  y  be  the  solution  to 

Pl  (y)P 

Pj  (y)p  +  PoMt1-*3)  ~  6n~i 

so  that  y  =  y  (P,  6  )  . 

n- 1 

5.  Let  7n=»Pin(2)  above.  By  (4)  and  the  fact  that 
G„lP)  *  T„C01  ■  w*  *'* 


TnC01  >  'n  W1  +  (1-\)C10  I  p0Wdx 

6.  Since  S'  A  by  theorem  D^, (5)  becomes 
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7.  We  postulate  that  y  >  P'=rO  .  Then 


n 


«  >y(P*,A)  >  (r  *  A) 
n 


and  (6)  becomes 
.00 

{ 

C 


10 


1'n> 


I  p0  (X)  * 

y  (P A) _ 


.  00 


C01  -  W1  +  C10  I  V*>  dx 

y  (P A) 

8.  By  letting  the  right-hand  side  of  (7)  be  equal  to  P  ’  ,  we 

note  that  P  1  =  0  is  a  solution,  and  we  have  proved  that 

lim  y  -  y  >  0  . 
n~*oo  *n 

If  the  assumption  made  in  (3)  does  not  hold  then  the  proof  is  invalid, 
and  in  fact  7  =  0.  A  similar  proof  is  followed  for  the  condition 
W0>°,  Wj-0. 


C.  Flow  Chart  of  Computer  Program 

We  present  in  Figure  C.  1  the  flow  chart  of  the  computer 
program  that  iteratively  calculates  C(F)  by  use  of  equation  (3.  3). 
The  programs  used  to  evaluate  other  quantities  in  this  work  are 
similar  in  structure  and  so  are  not  shown. 

The  x-axis  (where  x  is  the  observed  random  variable) 
extends  from  -4  to  +12  and  is  represented  by  161  points  .  1  apart. 
The  P-axis  goes  from  0  to  1  and  is  represented  by  101  points  .01 
apart.  It  is  assumed  that  f^(x;  0, 1)  =  0  for  x  <  -4  and  x  >  4  . 
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Cg  is  the  initial  value  for  the  iteration  and  is  read  in  for  each 
calculation.  To  evaluate  C  (P)  when  (.  01)  n  <  P  <  (.  01)(n+l)  ,  where 
n  =  0, 1, ....  99  ,  the  linear  approximation 

C(P)  «  C  [(.  01)  nj  (n+l-J00P)  =  C  [(.  01)(n+l)]  (n-lOOP) 


is  used,  which  is  exact  when  C  (P)  =  T  (P)  . 


The  integral 


r°°  / P,(x)P\ 

j8|l!,Cn-l  (  W)  ^ 


is  approximated  by  the  sum 


which  produces  for 


~co  -ao 


the  value  .9994. 


1 25 


*•* 
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